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^ designing new alpha amylases and methods of use thereof are also provided. The alpha amylases have increased activity and stability 
1^ at acidic, and alkaline pH and increased temperature. 



wo 02/068589 



PCT/US02/05068 



5 ENZYMES HAVING ALPHA AMYLASE ACTIVITY 

AND METHODS OF USE THEREOF 

RELATED APPLICATION DATA 

This application claims priority of U.S. Provisional Application No. 
1 0 60/270,495, filed February 2 1 , 2001 , now pending; U.S. Provisional Application No. 
60/270,496, filed February 21, 2001, now pending; and U.S. Provisional Application 
No. 60/291,122, filed May 14, 2001, now pending, all of which are herein 
incorporated by reference in their entirety. 



15 nELD OF THE INVENTION 

This invention relates generally to enzymes, polynucleotides encoding the 
enzymes, the use of such polynucleotides and polypeptides, and more specifically to 
enzymes having alpha amylase activity. 



20 BACKGROUND 

Starch is a complex carbohydrate often found in the human diet. The structure 
of starch is glucose polymers linked by a-1,4 and a-1,6 glucosidic bonds. Amylase is 
an enzyme that catalyzes the hydrolysis of starches into sugars. Amylases hydrolyze 

25 intemal a-l,4-glucosidic linkages in starch, largely at random, to produce smaller 
molecular weight malto-dextrins. The breakdown of starch is important in the 
digestive system and commercially. Amylases are of considerable commercial value, 
being used in the initial stages (liquefaction) of starch processing; in wet com milling; 
in alcohol production; as cleaning agents in detergent matrices; in the textile industry 

30 for starch desizing; in baking applications; in the beverage industry; in oilfields in 
drilling processes; in inking of recycled paper; and in animal feed. 

Amylases are produced by a wide variety of microorganisms including 
Bacillus and Aspergillus, with most commercial amylases being produced fi:om 
bacterial sources such as Bacillus licheniformiSy Bacillus amyloliquefaciens, Bacillus 
35 subtiliSy or Bacillus stearothermophilus. In recent years, the enzymes in commercial 
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5 use have been those from Bacillus licheniformis because of their heat stability and 
performance, at least at neutral and mildly alkaline pHs. 

In general, starch to fructose processing consists of four steps: liquefaction of 
granular starch, saccharification of the liquefied starch into dextrose, purification, and 
isomerization to fiiictose. The object of a starch liquefaction process is to convert a 

10 concentrated suspension of starch polymer granules into a solution of soluble shorter 
chain length dextrins of low viscosity. This step is essential for convenient handling 
with standard equipment and for efficient conversion to glucose or lO^other sugars. 
To liquefy granular starch, it is necessary to gelatinize the granules by raising the 
temperature of the granular starch to over about 72°C. The heating process 

15 instantaneously disrupts the insoluble starch granules to produce a water soluble 

starch solution. The solubilized starch solution is then liquefied by amylase. A starch 
granule is composed of: 69-74% amylopectin, 26-31% amylose, 1 1-14% water, 0.2- 
0.4% protein, 0.5-0.9% lipid, 0.05-0.1% ash, 0.02-0.03% phosphorus, 0.1% pentosan. 
Approximately 70% of a. granule is amorphous and 30% is crystalline. 

20 A common enzymatic liquefaction process involves adjusting the pH of a 

granular starch slurry to between 6.0 and 6.5, the pH optimum of alpha-amylase 
derived from Bacillus licheniformis, with the addition of calcium hydroxide, sodiiun 
hydroxide or sodium carbonate. The addition of calcium hydroxide has the advantage 
of also providing calcium ions which are known to stabilize the alpha-amylase against 

25 inactivation. Upon addition of alpha-amylase, the suspension is pimiped through a 
steam jet to instantaneously raise the temperature to between 80 degree-1 1 5 degrees 
C. The starch is inunediately gelatinized and, due to the presence of alpha-amylase, 
depolymerized through random hydrolysis of a (M) glycosidic bonds by alpha- 
amylase to a fluid mass which is easily pumped. 

30 In a second variation to the liquefaction process, alpha-amylase is added to the 

starch suspension, the suspension is held at a temperature of 80-100 degrees C to 
partially hydrolyze the starch granules, and the partially hydrolyzed starch suspension 
is pumped through a jet at temperatures in excess of about 105 degrees C to 
thoroughly gelatinize any remaining granular structure. After cooling the gelatinized 
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S starch, a second addition of alpha-amylase can be made to further hydrolyze the 
starch. 

A third variation of this process is called the dry milling process. In dry 
milling, whole grain is ground and combmed with water. The germ is optionally 
removed by flotation separation or equivalent techniques. The resulting mixture, 
10 which contains starch, fiber, protein and other components of the grain, is liquefied 
using .alpha. -amylase. The general practice in the art is to undertake enzymatic 
liquefaction at a lower temperature when using the dry milling process. Generally, 
low temperature liquefaction is believed to be less efficient than high temperature 
liquefaction in converting starch to soluble dextrins. 

1 5 Typically, after gelatinization the starch solution is held at an elevated 

temperature in the presence of alpha-amylase until a DE of 10-20 is achieved, usually 
a period of 1-3 hours. Dextrose equivalent (DE) is the industry standard for measuring 
the concentration of total reducing sugars, calculated as D-glucose on a dry weight 
basis. Unhydrolyzed granular starch has a DE of virtually zero, whereas the DE of D- 

20 glucose is defined as 100. 

Com wet milling is a process which produces com oil, gluten meal, gluten * 
feed and starch. Alkaline-amylase is used in the liquefaction of starch and 
glucoamylase is used in saccharification, producing glucose. Com, a kernel of which 
consists of a outer seed coat (fiber), starch, a combination of starch and glucose and 

25 the inner germ, is subjected to a four step process, which results in the production of 
starch. The com is steeped, de-germed, de-fibered, and finally the gluten is separated. 
In the steeping process, the solubles are taken out. The product remaining after 
removal of the solubles is de-germed, resulting in production of com oil and 
production of an oil cake, which is added to the solubles fit)m the steeping step. The 

30 remaining product is de-fibered and the fiber solids are added to the oil cake/solubles 
mixture. This mixture of fiber solids, oil cake and solubles forms a gluten feed. After 
de-fibering, the remaining product is subjected to gluten separation. This separation 
results in a gluten meal and starch. The starch is then subjected to liquefaction and 
saccharification to produce glucose. 
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5 Staling of baked products (such as bread) has been recognized as a problem 

which becomes more serious as more time lies between the moment of preparation of 
the bread product and the moment of consumption. The term staling is used to 
describe changes undesirable to the consumer in the properties of the bread product 
after leaving the oven, such as an increase of the firmness of the crumb, a decrease of: 
10 the elasticity of the crumb, and changes in the crust, which becomes tough and 
leathery.The firmness of the bread crumb increases fiuther during storage up to a 
level, which is considered as negative. The increase in crumb finnness, which is 
considered as the most important aspect of staling, is recognized by the consumer a 
long time before the bread product has otherwise become unsuitable for consumption. 

1 5 There is a need in the industry for the identification and optimization of 

amylases, usefiil for various uses, including commercial cornstarch liquefaction 
processes. These second generation acid amylases will offer improved manufacturing 
and/or performance characteristics over the industry standard enzymes from Bacillus 
licheniformis^ for example. 

20 There is also a need for the identification and optimization of amylases having 

utility in automatic dish wash (ADW) products and laundry detergent. In ADW 
products, the amylase will function at pH 10-1 1 and at 45-60 °C in the presence of 
calcium chelators and oxidative conditions. For laundry, activity at pH 9-10 and 40 ^'C 
in the appropriate detergent matrix will be required. Amylases are also useful in 

25 textile desizing, brewing processes, starch modification in the paper and pulp industry 
and other processes described in the art. 

The publications discussed herein are provided solely for their disclosure prior 
to the filing date of the present application. Nothing herein is to be construed as an 
admission that the invention is not entitled to antedate such disclosure by virtue of 
30 prior invention. 
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5 SUMMARY OF THE INVENTION 

The invention provides an isolated nucleic acid having a sequence as set forth 
in SEQ ID Nos.: 1, 3, 5, 9, 11. 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 
43, 45 , 47, 49, 51, 53, 55, 57, 59, 61, 63. 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 
89, 91,93, 95,97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123. 

10 125, 127, 129, 131. 133. 135, 137, 139, 141, 143, 145, 147, 149. 151, 153, 155, 157, 
159, 161, 163. 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197. 199, 201. 203, 205. 207, 209, 211, 213, 215. 217, 219, 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239. 241. 243, 245, 247, 249, 251, 253, 255, 257, 259, 
261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 

1 5 295, 297, 299 and variants thereof having at least 50% sequence identity to SEQ ID 
Nos.: 1, 3, 5, 9, 11. 13, 15, 17, 19, 21. 23. 25. 27, 29, 31, 33, 35, 37, 39, 41. 43. 45. 
47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71. 73, 75. 77, 79, 81, 83, 85, 87, 89, 91, 
93, 95,97,99, 101, 103, 105, 107, 109, 111, 113. 115, 117. 119. 121. 123, 125, 127, 
129, 131, 133, 135. 137. 139. 141, 143, 145, 147, 149, 151. 153. 155. 157, 159, 161, 

20 163, 165, 167, 169, 171, 173, 175, 177, 179, 181. 183. 185. 187. 189. 191, 193, 195, 
197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 
231, 233, 235, 237, 239, 241, 243, 245, 247. 249, 251, 253, 255, 257, 259, 261, 263, 
265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
299 and encoding polypeptides having alpha amylase activity. 

25 One aspect of the invention is an isolated nucleic acid having a sequence as set 

forth in SEQ ID Nos: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45. 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67. 69. 71, 73, 75, 77, 79, 81, 83, 
85, 87. 89. 91. 93, 95,97, 99, 101, 103. 105, 107, 109. 111. 113, 115. 117. 119. 121, 
123. 125. 127. 129. 131. 133. 135. 137, 139, 141, 143, 145, 147, 149, 151, 153. 155, 

30 157. 159. 161. 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183. 185, 187. 189, 
191, 193, 195, 197, 199, 201, 203, 205, 207, 209. 211, 213, 215, 217, 219, 221, 223, 
225, 227. 229. 231. 233, 235. 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279. 281, 283, 285, 287, 289, 291, 
293, 295, 297, 299 (hereinafter referred to as "Group A nucleic acid sequences"), 

35 sequences substantially identical thereto, and sequences complementary thereto. 
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5 Another aspect of the invention is an isolated nucleic acid including at least 10 

consecutive bases of a sequence as set forth in Group A nucleic acid sequences, 
sequences substantially identical thereto, and the sequences complementary thereto. 

In yet another aspect, the invention provides an isolated nucleic acid encoding 
a polypeptide having a sequence as set forth in SEQ ID Nos.: 2, 4, 6, 10, 12, 14, 16, 

10 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 

15 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258. 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298 and variants thereof encoding a 
polypeptide having alpha amylase activity and having at least 50% sequence identity 
to such sequences. 

20 Another aspect of the invention is an isolated nucleic acid encoding a 

polypeptide or a functional fragment thereof having a sequence as set forth in SEQ ID 
No.s: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 
94,96,98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 

25 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 
164, 166, 168, 170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 
198, 202. 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238. 240. 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 
268, 270, 272. 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294. 296, 298 

30 (hereinafter referred to as *'Group B amino acid sequences"), and sequences 
substantially identical thereto. 



Another aspect of the invention is an isolated nucleic acid encoding a 
polypeptide having at least 10 consecutive amino acids of a sequence as set forth in 
Group B amino acid sequences, and sequences substantially identical thereto. 
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5 In yet another aspect, the invention provides a purified polypeptide having a 

sequence as set forth in Group B amino acid sequences, and sequences substantially 
identical thereto. 

Another aspect of the invention is an isolated or purified antibody that 
specifically binds to a polypeptide having a sequence as set forth in Group B amino 
10 acid sequences, and sequences substantially identical thereto. 

Another aspect of the invention is an isolated or purified antibody or binding 
fi-agment thereof, which specifically binds to a polypeptide having at least 10 
consecutive amino acids of one of the polypeptides of Group B amino acid sequences, 
and sequences substantially identical thereto. 

1 S Another aspect of the invention is a method of making a polypeptide having a 

sequence as set forth in Group B amino acid sequences, and sequences substantially 
identical thereto. The method includes introducing a nucleic acid encoding the 
polypeptide into a host cell, wherein the nucleic acid is operably linked to a promoter, 
and culturing the host cell under conditions that allow expression of the nucleic acid. 

20 Another aspect of the invention is a method of making a polypeptide having at 

least 10 amino acids of a sequence as set forth in Group B amino acid sequences, and 
sequences substantially identical thereto. The method includes introducing a nucleic 
acid encoding the polypeptide into a host cell, wherein the nucleic acid is operably 
linked to a promoter, and culturing the host cell under conditions that allow 

25 expression of the nucleic acid, thereby producing the polypeptide. 

Another aspect of the invention is a method of generating a variant including 
obtaining a nucleic acid having a sequence as set forth in Group A nucleic acid 
sequences, sequences substantially identical thereto, sequences complementary to the 
sequences of Group A nucleic acid sequences, firagments comprising at least 30 
30 consecutive nucleotides of the foregoing sequences, and changing one or more 

nucleotides in the sequence to another nucleotide, deleting one or more nucleotides in 
the sequence, or adding one or more nucleotides to the sequence. 
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5 Another aspect of the invention is a computer readable medium having stored 

thereon a sequence as set forth in Group A nucleic acid sequences, and sequences 
substantially identical thereto, or a polypeptide sequence as set forth in Group B 
amino acid sequences, and sequences substantially identical thereto. 

Another aspect of the invention is a computer system including a processor 
10 and a data storage device wherein the data storage device has stored thereon a 

sequence as set forth in Group A nucleic acid sequences, and sequences substantially 
identical thereto, or a polypeptide having a sequence as set forth in Group B amino 
acid sequences, and sequences substantially identical thereto. 

Another aspect of the invention is a method for comparing a first sequence to a 
IS reference sequence wherein the first sequence is a nucleic acid having a sequence as 
set forth in Group A nucleic acid sequences, and sequences substantially identical 
thereto, or a polypeptide code of Group B amino acid sequences, and sequences 
substantially identical thereto. The method includes reading the first sequence and the 
reference sequence through use of a computer program which compares sequences; 
20 and determining differences between the first sequence and the reference sequence 
with the computer program. 

Another aspect of the invention is a method for identifying a feature in a 
sequence as set forth in Group A nucleic acid sequences, and sequences substantially 
identical thereto, or a polypeptide having a sequence as. set forth in Group B amino 
25 acid sequences, and sequences substantially identical thereto, including reading the 
sequence througjh the use of a computer program which identifies features in 
sequences; and identifying features in the sequence with the computer program. 

Another aspect of the invention is an assay for identifying fragments or 
variants of Group B amino acid sequences, and sequences substantially identical 
30 thereto, which retain the enzymatic function of the polypeptides of Group B amino 
acid sequences, and sequences substantially identical thereto. The assay includes 
contacting the polypeptide of Group B amino acid sequences, sequences substantially 
identical thereto, or polypeptide firagment or variant with a substrate molecule under 
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5 conditions which allow the polypeptide fragment or variant to function, and detecting 
either a decrease in the level of substrate or an increase in the level of the specific 
reaction product of the reaction between the polypeptide and substrate thereby 
identifying a fragment or variant of such sequences. 

The invention also provides a process for preparing a dough or a baked 
10 product prepared from the dough which comprises adding an amylase of the invention 
to the dough in an amount which is effective to retard the staling of the bread. The 
invention also provides a dough comprising said amylase and a premix comprising 
flour together with said amylase. Finally, the invention provides an enzymatic baking 
additive, which contains said amylase. 

1 5 The use of the amylase in accordance with the present invention provides an 

improved anti-staling effect as measured by, e.g. less crumb firming, retained crumb 
elasticity, improved slice-ability (e.g. fewer crumbs, non-gummy crumb), improved 
palatability or flavor. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 The following drawings are illustrative of embodiments of the invention and 

are not meant to limit the scope of the invention as encompassed by the claims. 

Figure 1 is a block diagram of a computer system. 

Figure 2 is a flow diagram illustrating one embodiment of a process for 
comparing a new nucleotide or protein sequence with a database of sequences in order 
25 to determine the homology levels between the new sequence and the sequences in the 
database. 

Figure 3 is a flow diagram illustrating one embodiment of a process in a 
computer for determining whether two sequences are homologous. 

Figure 4 is a flow diagram illustrating one embodiment of an identifier 
30 process 300 for detecting the presence of a feature in a sequence. 
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5 Figure 5 is a graph showing the Residual activity of various amylases 

following heating to 90^C for 10 min in Example 1 . 

Figure 6 is a graph showing the net percent starch removed versus enzyme 
concentration in ADW wash test with bleach and chelators. 

Figure 7 is a graph showing the activity of parental amylases at pH 8, 40^C in 
1 0 ADW formulation at 55^C. 

Figure 8 is a graph of data regarding the H2O2 tolerance of the novel enzymes 
in Example 4. 

Figure 9 is a graph of the pH and temperature data for a selection of the 
amylases characterized. Figure 9a shows the data at pH 8 and 40**C and Figure 9b 
1 5 shows the data at pH 10 and 50X. 

Figure 10 sets forth the sequences to be used in reassembly experiments with 
the enzymes. 

Figure 11 illustrates a sample Standard Curve of the assay of Example S. 

Figure 12 illustrates the pH rate profiles for SEQ ID NO.: 127, which has a 
20 neutral optimum pH and SEQ ID NO.: 211, which has an optimum around pH 10. 

Figure 13 shows the stability of Di versa amylases vs. a commercial enzyme, 
as discussed in Example 2. 

Figure 14 shows the sequence alignments of hypothermophilic a-amylases, as 
set forth in Example 8. Figure 14a shows shows an alignment of amylase sequences. 

25 SEQ ID NO.: 81= an environmental clone; pyro = Pyrococcus sp. (strain:KODl), 

Tachibana,Y., Mendez,L., Takagi,M. and Imanaka,T., J Ferment. Bioeng. 82:224-232, 
1996; pyro2 = Pyrococcus furiosus, Appl. Environ. Microbiol. 63 (9):3569-3576, 
1997; Thermo = Thermococcus sp.; Thermo2 = Thermococcus hydrothermalis, 
Leveque,E. etaL Patent: France 98.05655 05-MAY-1998. unpublished. Figure 14b 

30 shows the amino acid sequence alignment of identified sequences: SEQ ID NO. : 8 1 ; 
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5 pyro; SEQ ID NO.:75; SEQ ID NO.: 77; SEQ ID NO.: 83; SEQ ID NO.: 85; 

thenno2; SEQ ID NO.: 79 ; thermo ; pyro2 ; clone A; thermoS, Figure 14c shows the 
nucleic acid sequence alignment corresponding to the polypeptide sequence of 
Figures 5 and 6. SEQ ID NO.: 81-; SEQ ID NO.:75; SEQ ID NO.: 77; SEQ ID NO.: 
83; SEQ ID NO.: 85; SEQ ID NO.: 79; clone A; and SEQ ID NO.: 73. 

10 Figure 15 is a neighbor-joining tree for Thermococcales, 

Figure 16 is the sequences of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to amylases and polynucleotides encoding them. 
As used herein, the term ^'amylase*' encompasses enzymes having alpha amylase 

1 5 activity, for example, alpha amylases capable of hydrolyzing internal a-1 ,4-glucan 
links in polysaccharides, including amylase enzymes capable of hydrolyzing starch to 
sugars at alkaline pHs or at acidic pHs. Amylases of the invention are particularly 
useful in corn-wet milling processes, detergents, baking processes, beverages and in 
oilfields (fuel ethanol). Amylases are also useful in textile desizing, brewing 

20 processes, starch modification in the paper and pulp industry and other processes 
described in the art. 

The polynucleotides of the invention have been identified as encoding 
polypeptides having alpha amylase or alkaline amylase activity. Alkaline amylases of 
the invention may include, but are not limited to: SEQ ID NO.: 115, SEQ ID 
25 NO.:207, SEQ ID NO.: 139, SEQ ID NO.:127, SEQ ID NO.: 137, SEQ ID N0.:1 13, 
SEQ ID NO.:205, SEQ ID NO.: 179, SEQ ID NO.: 151, SEQ ID NO.: 187. SEQ ID 
NO.:97, SEQ ID NO.: 153, SEQ ID NO.: 69, SEQ ID NO.: 135, SEQ ID NO.: 189, 
SEQ ID NO.: 1 19, SEQ ID NO: 209 and SEQ ID NO: 21 1 . 

Alterations in properties which may be achieved in variants of the invention 
30 are alterations in, e.g., substrate specificity, substrate binding, substrate cleavage 
pattern, thermal stability, pH/activity profile, pH/stability profile [such as increased 
stability at low (e.g. pH<6, in particular pH<5) or high (e.g. pH>9) pH values], 
stability towards oxidation, Ca^*^ dependency, specific activity, and other properties of 
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S interest. For instance, the alteration may result in a variant which, as compared to the 
parent amylase, has a reduced Ca^* dependency and/or an altered pH/activity profile. 

The present invention relates to alpha amylases and polynucleotides encoding 
them. As used herein, the term "alpha amylase'* encompasses enzymes having alpha 
amylase activity, for example, enzymes capable of hydrolyzing starch to sugars. 
1 0 Unlike many known amylases, the amylases of the invention may not be calcium- 
dependent enzymes. 

It is highly desirable to be able to decrease the Ca2+ dependency of an alpha 
amylase. Accordingly, one aspect of the invention provides an amylase enzyme that 
has a decreased Ca2+ dependency as compared to commercial or parent amylases. 

1 5 Decreased Ca2+ dependency will in general have the functional consequence that the 
variant exhibits a satisfactory amylolytic activity in the presence of a lower 
concentration of calcium ion in the extraneous medium than is necessary for a 
commercial or parent enzyme. It will further often have the consequence that the 
variant is less sensitive to calcium ion-depleting conditions such as those obtained in 

20 media containing calcium-complexing agents (such as certain detergent builders). 

''Liquefaction" or "liquefy" means a process by which starch is converted to 
shorter chain and less viscous dextrins. Generally, this process involves gelatinization 
of starch simultaneously with or followed by the addition of alpha amylase. In 
commercial processes, it is preferred that the granular starch is derived torn a source 
25 comprising com, wheat, milo, sorghum, rye or bulgher. However, the present 

invention applies to any grain starch source which is useful in liquefaction, e.g., any 
other grain or vegetable source known to produce starch suitable for liquefaction. 

"Granular starch" or "starch granules" means a water-insoluble component of 
edible grains which remains after removal of the hull, fiber, protein, fat, germ, and 
30 solubles through the steeping, mechanical cracking, separations, screening, 
countercurrent rinsing and centrifugation steps typical of the grain wet-milling 
process. Granular starch comprises mtact starch granules containing, almost 
exclusively, packed starch molecules (i.e., amylopectin and amylose). In com, the 



wo 02/068589 



PCT/US02/05068 



13 

5 granular starch component comprises about 99% starch; the remaining 1 % being 
comprised of protein, fat, ash, fiber and trace components tightly associated with the 
granules. The packing structure of granular starch severely retards the ability of 
.alpha.-amylase to hydrolyze starch. Gelatinization of the starch is utilized to disrupt 
the granules to form a soluble starch solution and facilitate enzymatic hydrolysis. 

10 "Starch solution" means the water soluble gelatinized starch which results 

from heating granular starch. Upon heating of the granules to above about 72 degrees 
C, granular starch dissociates to form an aqueous mixture of loose starch molecules. 
This mixture comprising, for example, about 75% amylopectin and 25% amylose in 
yellow dent com forms a viscous solution in water. In commercial processes to form 

1 5 glucose or fructose, it is the starch solution which is liquefied to form a soluble 
dextrin solution, "alpha amylase" means an enzymatic activity which cleaves or 
hydrolyzes the alpha (1-4) glycosidic bond, e.g., that in starch, amylopectin or 
amylose polymers. Suitable alpha amylases are the naturally occurring alpha amylases 
as well as recombinant or mutant amylases which are useful in liquefaction of starch. 

20 Techniques for producing variant amylases having activity at a pH or temperature, for 
example, that is different from the wild-type amylase, are included herein. 

The temperature range of the liquefaction is generally any liquefaction 
temperature which is known to be effective in liquefying starch. Preferably, the 
temperatiu-e of the starch is between about 80 degrees C to about 1 15 degrees C, more 
25 preferably from about 100 degrees C to about 1 10 degrees C, and most preferably 
from about 105 degrees C to about 108 degrees C. 

In one embodiment, the signal sequences of the invention are identified 
following identification of novel amylase polypeptides. The pathways by which 
proteins are sorted and transported to their proper cellular location are often referred 
30 to as protein targeting pathways. One of the most important elements in all of these 
targeting systems is a short amino acid sequence at the amino terminus of a newly 
synthesized polypeptide called the signal sequence. This signal sequence directs a 
protein to its appropriate location in the cell and is removed during transport or when 
the protein reaches its final destination. Most lysosomal, membrane, or secreted . 
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5 proteins have an amino-terminal signal sequence that marks them for translocation 
into the lumen of the endoplasmic reticulum. More than 100 signal sequences for 
proteins in this group have been determined. The sequences vary in length from 13 to 
36 amino acid residues. Various methods of recognition of signal sequences are 
known to those of skill in the art. In one embodiment, the peptides are identified by a 

10 method referred to as SignalP. SignalP uses a combined neural network which 

recognizes both signal peptides and their cleavage sites. (Nielsen, H., Engelbrecht, J., 
Brunalk, S., von Heijne, G., **Identification of prokaiyotic and eukaryotic signal 
peptides and prediction of their cleavage sites." Protein Engineering, vol. 10, no. 1, p. 
1 -6 (1 997), hereby incorporated by reference.) It should be understood that some of 

1 S the amylases of the invention may not have signal sequences. It may be desirable to 
include a nucleic acid sequence encoding a signal sequence from one amylase 
operably linked to a nucleic acid sequence of a different amylase or, optionally, a 
signal sequence from a non-amylase protein may be desired. Table 3 shows signal 
seqeunes of the invention. 

20 The phrases '^nucleic acid" or '"nucleic acid sequence" as used herein refer to 

an oligonucleotide, nucleotide, polynucleotide, or to a fragment of any of these, to 
DNA or RNA of genomic or synthetic origin which may be single-stranded or double- 
stranded and may represent a sense or antisense strand, to peptide nucleic acid (PNA), 
or to any DNA-like or RNA-like material, natural or synthetic in origin. 

25 A "coding sequence of* or a "nucleotide sequence encoding" a particular 

polypeptide or protein, is a nucleic acid sequence which is transcribed and translated 
into a polypeptide or protein when placed under thie control of appropriate regulatory 
sequences. 

The term "gene" means the segment of DNA involved in producing a 
30 polypeptide chain; it includes regions preceding and following the coding region 
(leader and trailer) as well as, where applicable, intervening sequences (introns) 
between individual coding segments (exons). 
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5 "Amino acid*' or "amino acid sequence" as used herein refer to an 

oligopeptide, peptide, polypeptide, or protein sequence, or to a fragment, portion, or 
subunit of any of these, and to naturally occurring or synthetic molecules. 

The term "polypeptide" as used herein, refers to amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 

10 contain modified amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as post-translational 
processing, or by chemical modification techniques which are well known in the art. 
Modifications can occur anywhere in the polypeptide, including the peptide backbone, 
the amino acid side-chains and the amino or carboxyl termini. It will be appreciated 

1 5 that the same type of modification may be present in the same or varying degrees at 
several sites in a given polypeptide. Also a given polypeptide may have many types 
of modifications. Modifications include acetylation, acylation, ADP-ribosylation, 
amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 

20 lipid or lipid derivative, covalent attachment of a phosphytidylinositol, cross-linking 
cyclization, disulfide bond formation, demethylation, formation of covalent cross- 
links, formation of cysteine, formation of pyroglutamate, formylation, gamma- 
caifooxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, 
methylation, myristolyation, oxidation, pergylation, proteolytic processing, 

25 phosphorylation, prenylation, racemization, selenoylation, sulfation, and transfer- 
RNA mediated addition of amino acids to protein such as arginylation. (See 
Creighton, T.E., Proteins - Structure and Molecular Properties 2nd Ed., W.H. 
Freeman and Company, New York (1993); Posttranslational Covalent Modification of 
Proteins, B.C. Johnson, Ed., Academic Press, New York, pp. 1-12 (1983)). 

30 As used herein, the term "isolated" means that the material is removed fi-om its 

original environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally-occurring polynucleotide or polypeptide present in a living 
animal is not isolated, but the same polynucleotide or polypeptide, separated fi-om 
some or all of the coexisting materials in the natural system, is isolated. Such 
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5 polynucleotides could be part of a vector and/or such polynucleotides or polypeptides 
could be part of a composition, and still be isolated in that such vector or composition 
is not part of its natural environment. 

As used herein, the term "purified" does not require absolute purity; rather, it 
is intended as a relative definition. Individual nucleic acids obtained from a library 

10 have been conventionally purified to electrophoretic homogeneity. The sequences 
obtained from these clones could not be obtained directly either fi-om the library or 
from total human DNA» The purified nucleic acids of the invention have been 
purified fi-om the remainder of the genomic DNA in the organism by at least 104-106 
fold. However, the term "purified" also includes nucleic acids which have been 

15 purified fi-om the remainder of the genomic DNA or firom other sequences in a library 
or oflier environment by at least one order of magnitude, typically two or three orders, 
and more typically four or five orders of magnitude. 

As used herein, the term '^recombinant" means that the nucleic acid is adjacent 
to a "backbone" nucleic acid to which it is not adjacent in its natural environment, 

20 Additionally, to be "enriched" the nucleic acids will represent 5% or more of the 
number of nucleic acid inserts in a population of nucleic acid backbone molecules. 
Backbone molecules according to the invention include nucleic acids such as 
expression vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, 
and other vectors or nucleic acids used to maintain or manipulate a nucleic acid insert 

25 of interest. Typically, the enriched nucleic acids represent 15% or more of the 

number of nucleic acid inserts in the population of recombinant backbone molecules. 
More typically, the enriched nucleic acids represent 50% or more of the number of 
nucleic acid inserts in the population of recombinant backbone molecules. In a one 
embodiment, the enriched nucleic acids represent 90% or more of the number of 

30 nucleic acid inserts in the population of recombinant backbone molecules. 

"Recombinant" polypeptides or proteins refer to polypeptides or proteins 
produced by recombinant DNA techniques; i.e., produced fi-om cells transformed by 
an exogenous DNA construct encoding the desired polypeptide or protein. 
"Synthetic" polypeptides or protein are those prepared by chemical synthesis. Solid- 
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5 phase chemical peptide synthesis methods can also be used to synthesize the 

polypeptide or fragments of the invention. Such method have been known in the art 
since the early 1960's (Merrifield, R. B., J. Am. Chem. Soc, 85:2149-2154, 1963) 
(See also Stewart, J. M. and Young, J. D., Solid Phase Peptide Synthesis, 2nd Ed., 
Pierce Chemical Co., Rockford, 111., pp. 1 1-12)) and have recently been employed in 

1 0 commercially available laboratory pq)tide design and synthesis kits (Cambridge 

Research Biochemicals). Such commercially available laboratory kits have generally 
utilized the teachings of H. M. Geysen et al, Proc. Natl. Acad. Sci., USA, 81:3998 
(1984) and provide for synthesizing peptides upon the tips of a multitude of ""rods" or 
'"pins'* all of which are connected to a single plate. When such a system is utilized, a 

1 5 plate of rods or pins is inverted and inserted into a second plate of corresponding 
wells or reservoirs, which contain solutions for attaching or anchoring an appropriate 
amino acid to the pin's or rod*s tips. By repeating such a process step, i.e., inverting 
and inserting the rod's and pin's tips into appropriate solutions, amino acids are built 
into desired peptides. In addition, a number of available FMOC peptide synthesis 

20 systems are available. For example, assembly of a polypeptide or fragment can be 
carried out on a solid support using an Applied Biosystems, Inc. Model 43 1 A 
• automated peptide synthesizer. Such equipment provides ready access to the peptides 
of the invention, either by direct synthesis or by synthesis of a series of fragments that 
can be coupled using other known techniques. 

25 A promoter sequence is "operably linked to" a coding sequence when RNA 

polymerase which initiates transcription at the promoter will transcribe the coding 
sequence into mRNA. 

"Plasmids" are designated by a lower case **p" preceded and/or followed by 
capital letters and/or numbers. The starting plasmids herein are either commercially 
30 available, publicly available on an unrestricted basis, or can be constructed from 
available plasmids in accord with published procedures. In addition, equivalent 
plasmids to those described herein are known in the art and will be apparent to the 
ordinarily skilled artisan. 
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5 "Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 

enzyme that acts only at certain sequences in the DNA. The various restriction 
enzymes used herein arc commercially available and their reaction conditions, 
cofactors and other requirements were used as would be known to the ordinarily 
skilled artisan. For analytical purposes, typically 1 jig of plasmid or DNA fragment is 

10 used with about 2 units of enzyme in about 20 ^1 of buffer solution. For the purpose 
of isolating DNA fragments for plasmid construction, t3i)ically 5 to 50 \xg of DNA are 
digested with 20 to 250 units of enzyme in a larger volume. Appropriate buffers and 
substrate amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37X are ordinarily used, but may 

1 S vary in accordance with the supplier's instructions. After digestion, gel 
electrophoresis may be performed to isolate the desired fiagment. 

"Oligonucleotide" refers to either a single stranded polydeoxynucleotide or 
two complementary polydeoxynucleotide strands which may be chemically 
synthesized. Such synthetic oligonucleotides have no 5' phosphate and thus will not 
20 ligate to another oligonucleotide without adding a phosphate with an ATP m the 

presence of a kinase. A synthetic oligonucleotide will ligate to a fragment that has not 
been dephosphorylated. 

The phrase "substantially identical" in the context of two nucleic acids or 
polypeptides, refers to two or more sequences that have at least 50%, 60%, 70%, 

25 80%, and in some aspects 90-95% nucleotide or amino abid residue identity, when 
compared and aligned for maximum correspondence, as measured using one of the 
known sequence comparison algorithms or by visual inspection. Typically, the 
substantial identity exists over a region of at least about 100 residues, and most 
conmionly the sequences are substantially identical over at least about 150-200 

30 residues. In some embodiments, the sequences are substantially identical over the 
entire length of the coding regions. 

Additionally a "substantially identical" amino acid sequence is a sequence that 
differs from a reference sequence by one or more conservative or non-conservative 
amino acid substitutions, deletions, or insertions, particularly when such a substitution 
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5 occurs at a site that is not the active site of the molecule, and provided that the 
polypeptide essentially retains its functional properties. A conservative amino acid 
substitution, for example, substitutes one amino acid for another of the same class 
(e.g., substitution of one hydrophobic amino acid, such as isoleucin, valine, leucine, 
or methionine, for another, or substitution of one polar amino acid for another, such as 

10 substitution of arginine for lysine, glutamic acid for aspartic acid or glutamine for 
asparagine). One or more amino acids can be deleted, for example, from an alpha 
amylase polypeptide, resulting in modification of the structure of the polypeptide, 
without significantly altering its biological activity. For exanriple, amino- or caiboxyl- 
terminal amino acids that are not required for alpha amylase biological activity can be 

1 5 removed. Modified polypeptide sequences of the invention can be assayed for alpha 
amylase biological activity by any number of methods, including contacting the 
modified polypeptide sequence with an alpha amylase substrate and determining 
whether the modified polypeptide decreases the amount of specific substrate in the 
assay or increases the bioproducts of the enzymatic reaction of a functional alpha 

20 amylase polypeptide with the substrate. 

"Fragments** as used herein are a portion of a naturally occurring protein 
which can exist in at least two different conformations. Fragments can have the same 
or substantially the same amino acid sequence as the iiaturally occurring protein. 
"Substantially the same" means that an amino acid sequence is largely, but not 

25 entirely, the same, but retains at least one fimctional activity of the sequence to which 
it is related. In general two amino acid sequences are "substantially the same" or 
"substantially homologous" if they are at least about 85% identical. Fragments which 
have different three dimensional structures as the naturally occurring protein are also 
included. An example of this, is a **pro-fonn" molecule, such as a low activity 

30 proprotein that can be modified by cleavage to produce a mature enzyme with 
significantly higher activity. 



"Hybridization" refers to the process by which a nucleic acid strand joins with 
a complementary strand through base pairing. Hybridization reactions can be 
sensitive and selective so that a particular sequence of interest can be identified even 
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5 in samples in which it is present at low concentrations. Suitably stringent conditions 
can be defined by, for example, the concentrations of salt or formamide in the 
prehybridization and hybridization solutions, or by the hybridization temperature, and 
are well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the 
10 hybridization temperature. 

For example, hybridization under high stringency conditions could occur in 
about 50% formamide at about 37^C to 42''C. Hybridization could occur under 
reduced stringency conditions in about 35% to 25% formamide at about 30°C to 
35®C. In particular, hybridization could occur under high stringency conditions at 

15 42^C in 50% formamide, 5X SSPE, 0.3% SDS, and 200 n/ml sheared and denatured 
salmon sperm DNA. Hybridization could occur under reduced stringency conditions 
as described above, but in 35% formamide at a reduced temperature of 35°C. The 
temperature range corresponding to a particular level of stringency can be further 
narrowed by calculating the purine to pyrimidine ratio of the nucleic acid of interest 

20 and adjusting the temperature accordingly. Variations on the above ranges and 
conditions are well known in the art. 

The term "variant" refers to polynucleotides or polypeptides of the invention 
modified at one or more base pairs, codons, introns, exons, or amino acid residues 
(respectively) yet still retain the biological activity of an alpha amylase of the 

25 invention. Variants can be produced by any number of means included methods such 
as, for example, error-prone PCR, shuffling, oligonucleotide-directed mutagenesis, 
assembly PCR, sexual PCR mutagenesis, in vivo mutagenesis, cassette mutagenesis, 
recursive ensemble mutagenesis, exponential ensemble mutagenesis, site-specific 
mutagenesis, gene reassembly, GSSM and any combination thereof Techniques for 

30 producing variant amylases having activity at a pH or temperature, for example, that 
is different from the wild-type amylase, are included herein. 



Enzymes are highly selective catalysts. Their hallmark is the ability to catalyze 
reactions with exquisite stereo-, regio-, and chemo- selectivities that are unparalleled 
in conventional synthetic chemistry. Moreover, enzymes are remarkably versatile. 
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5 They can be tailored to function in organic solvents, operate at extreme pHs (for 
example, high pHs and low pHs) extreme temperatures (for example, high 
temperatures and low temperatures), extreme salinity levels (for example, high 
salinity and low salinity), and catalyze reactions with compounds that are structurally 
unrelated to their natural, physiological substrates. 

10 Enzymes are reactive toward a wide range of natural and unnatural substrates, 

thus enabling the modification of virtually any organic lead compound. Moreover, 
unlike traditional chemical catalysts, enzymes are highly enantio- and regio-selective. 
The high degree of functional group specificity exhibited by enzymes enables one to 
keep track of each reaction in a synthetic sequence leading to a new active compound. 

IS Enzymes are also capable of catalyzing many diverse reactions unrelated to their 
physiological function in nature. For example, peroxidases catalyze the oxidation of 
phenols by hydrogen peroxide. Peroxidases can also catalyze hydroxylation reactions 
that are not related to the native function of the enzyme. Other examples are proteases 
which catalyze the breakdown of polypeptides. In organic solution some proteases can 

20 also acylate sugars, a function unrelated to the native function of these enzymes. 

In one aspect, the invention includes a method for liquefying a starch 
containing composition comprising contacting the starch with a polypeptide of the 
invention (e.g., a purified polypeptide selected from polypeptides having an amino 
acid sequence selected from the group consisting of: Group B amino acid sequences; 

25 variants having at least about 50% homology to at least one of Group B amino acid 
sequences, over a region of at least about 100 residues, as determined by analysis with 
a sequence comparison algorithm or by visual inspection; sequences complementary 
to any one of Group B amino acid sequences; and sequences complementary to 
variants having at least about 50% homology to any one of Group B amino acid 

30 sequences over a region of at least about 1 00 residues, as determined by analysis with 
a sequence comparison algorithm or by visual inspection; and polypeptides having at 
least 10 consecutive amino acids of a polypeptide having a sequence selected from the 
group consisting of Group B amino acid sequences). In one preferred embodiment, 
the polypeptide is set forth in Group B amino acid sequences. The starch may be from 
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5 a material selected from rice, germinated rice, com, barley, wheat, legumes and sweet 
potato. A glucose syrup produced by the method of the invention is included herein. 
Such a syrup can be a maltose syrup, a glucose syrup, or a combination thereof. In 
particular, the syrups produced using the amylases of the invention there is a higher 
level of DP2 fraction and a higher level of DP3 (maltotriose and/or panose) and less 
10 of the greater than DP7 fragments as compared to the syrups produced by commercial 
enzymes. This is consistent with the liquefaction profile since less of the large 
fragments are in the invention liquefied syrups. 

The invention also provides a method for removing starch containing stains 
from a material comprising contacting the material with a polypeptide of the 

15 invention. In one aspect, the invention provides a method for washing an object 

comprising contacting the object with a polypeptide of the invention under conditions 
sufficient for washing. A polypeptide of the invention may be included as a detergent 
additive for example. The invention also includes a method for textile desizing 
comprising contacting the textile with a polypeptide of the invention under conditions 

20 sufficient for desizing. 

The invention also provides a method of reducing the staling of bakery 
products comprising addition of a polypeptide of the invention to the bakery product, 
prior to baking. 

The invention also provides a method for the treatment of lignocellulosic 
25 fibers, wherein the fibers are treated with a polypeptide of the invention, in an amount 
which is efficient for improving the fiber properties. The invention includes a for 
enzymatic deinking of recycled paper pulp, wherein the polypeptide is applied in an 
amount which is efficient for effective deinking of the fiber surface. 

Any of the methods described herein include the possibility of the addition of 
30 a second alpha amylase or a beta amylase or a combination thereof Commercial 
amylases or other enzymes suitable for use in combination with an enzyme of the 
invention are known to those of skill in the art. 
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5 The invention also includes a method of increasing the flow of production 

fluids from a subterranean formation by removing a viscous, starch-containing, 
damaging fluid formed during production operations and found within the 
subterranean formation which surrounds a completed well bore comprising allowing 
production fluids to flow from the well bore; reducing the flow of production fluids 

10 from the formation below expected flow rates; formulating an enzyme treatment by 
blending together an aqueous fluid and a polypeptide of the invention; pumpmg the 
enzyme treatment to a desired location within the well bore; allowing the enzyme 
treatment to degrade the viscous, starch-containing, damaging fluid, whereby the fluid 
can be removed from the subterranean fomiation to the well surface; and wherein the 

1 5 enzyme treatment is effective to attack the alpha glucosidic linkages in the starch- 
containing fluid. 

The present invention exploits the unique catalytic properties of enzymes. 
Whereas the use of biocatalysts (i.e., purified or crude enzymes, non-living or living 
cells) in chemical transformations normally requires the identification of a particular 
20 biocatalyst that reacts with a specific starting compound, the present invention uses 
selected biocatalysts and reaction conditions that are specific for functional groups 
that are present in many starting compounds. 

Each biocatalyst is specific for one functional group, or several related 
functional groups, and can react with many starting compounds containing this 
25 functional group. 

The biocatalytic reactions produce a population of derivatives from a single 
starting compound. These derivatives can be subjected to another round of 
biocatalytic reactions to produce a second population of derivative compounds. 
Thousands of variations of the original compound can be produced with each iteration 
30 of biocatalytic derivatization. 

Enzymes react at specific sites of a starting compound without affecting the 
rest of the molecule, a process which is very difficult to achieve using traditional 
chemical methods. This high degree of biocatalytic specificity provides the means to 
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5 identify a single active compound within the Hbrary. The library is characterized by 
the series of biocatalytic reactions used to produce it, a so-called "biosynthetic 
history". Screening the library for biological activities and tracing the biosynthetic 
history identifies the specific reaction sequence producing the active compound. The 
reaction sequence is repeated and the structure of the synthesized compound 

10 detennined. This mode of identification, unlike other synthesis and screening 
approaches, does not require immobilization technologies, and compounds can be 
synthesized and tested free in solution using virtually any type of screening assay. It 
is important to note, that the high degree of specificity of enzyme reactions on 
functional groups allows for the "tracking" of specific enzymatic reactions that make 

IS up the biocatalytically produced library. 

There are many advantages to screening lambda phage libraries for 
expression-based discovery of amylases. These include improved detection of toxic 
clones; improved access to substrate; reduced need for engineering a host; by-passing 
the potential for any bias resulting from mass excision of the library; and faster 
20 growth at low clone densities. Additionally, there are advantages to screening lambda 
phage libraries in liquid phase over solid phase. These include: greater flexibility in 
assay conditions; additional substrate flexibility; higher sensitivity for weak clones; 
and ease of automation. 

Many of the procedural steps are performed using robotic automation enabling 
25 the execution of many thousands of biocatalytic reactions and screening assays per 
day as well as ensuring a high level of accuracy and reproducibility. As a result, a 
library of derivative compounds can be produced in a matter of weeks which would 
take years to produce using current chemical methods. (For further teachings on 
modification of molecules, including small molecules, see PCT/US94/09174, herein 
30 incoiporated by reference in its entirety). 

In one aspect, the present invention provides a non-stochastic method temied 
synthetic gene reassembly, that is somewhat related to stochastic shuffling, save that 
the nucleic acid building blocks are not shuffled or concatenated or chimerized 
randomly, but rather are assembled non-stochastically. 
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5 The synthetic gene reassembly method does not depend on the presence of a 

high level of homology between polynucleotides to be shuffled. The invention can be 
used to non-stochastically generate libraries (or sets) of progeny molecules comprised 
of over 10*^ different chimeras. Conceivably, synthetic gene reassembly can even be 
used to generate libraries comprised of over 10^**^ different progeny chimeras. Thus, 

10 in one aspect, the invention provides a non-stochastic method of producing a set of 
finalized chimeric nucleic acid molecules having an overall assembly order that is 
chosen by design, which method is comprised of the steps of generating by design a 
plurality of specific nucleic acid building blocks having serviceable mutually 
. compatible ligatable ends, and assembling these nucleic acid building blocks, such 

1 S that a designed overall assembly order is achieved. 

The mutually compatible ligatable ends of the nucleic acid building blocks to 
be assembled are considered to be "serviceable" for this type of ordered assembly if 
they enable the building blocks to be coupled in predetermined orders. Thus, in one 
aspect, the overall assembly order in which the nucleic acid building blocks can be 

20 coupled is specified by the design of the ligatable ends and, if more than one assembly 
step is to be used, then the overall assembly order in which the nucleic acid building 
blocks can be coupled is also specified by the sequential order of the assembly step(s). 
In a one embodiment of the invention, the annealed building pieces are treated with an 
enzyme, such as a ligase (e.g., T4 DNA ligase) to achieve covalent bonding of the 

25 buildmg pieces. 

In a another embodiment, the design of nucleic acid building blocks is 
obtained upon analysis of the sequences of a set of progenitor nucleic acid templates 
that serve as a basis for producing a progeny set of finalized chimeric nucleic acid 
molecules. These progenitor nucleic acid templates thus serve as a source of 
30 sequence information that aids in the design of the nucleic acid building blocks that 
are to be mutagenized, e.g., chimerized or shuffled. 

In one exemplification, the invention provides for the chimerization of a 
family of related genes and their encoded family of related products. In a particular 
exemplification, the encoded products are enzymes. The amylases of the present 



wo 02/068589 



PCTAJS02/05068 



26 

5 invention, for example, alpha amylases or alkaline amylases, can be mutagenized in 
accordance with the methods described herein. 

Thus according to one aspect of the invention, the sequences of a plurality of 
progenitor nucleic acid templates (e.g., polynucleotides of Group A nucleic acid 
sequences) are aligned in order to select one or more demarcation points, which 
10 demarcation points can be located at an area of homology. The demarcation points 
can be used to delineate the boundaries of nucleic acid building blocks to be 
generated. Thus, the demarcation points identified and selected in the progenitor 
molecules serve as potential chimerization points in the assembly of the progeny 
molecules. 

15 Typically a serviceable demarcation point is an area of homology (comprised 

of at least one homologous nucleotide base) shared by at least two progenitor 
templates, but the demarcation point can be an area of homology that is shared by at 
least half of the progenitor templates, at least two thirds of the progenitor templates, at 
least three fourths of the progenitor templates, and preferably at almost all of the 

20 progenitor templates. Even more preferably still a serviceable demarcation point is an 
area of homology that is shared by all of the progenitor templates. 

In a one embodiment, the gene reassembly process is performed exhaustively 
in order to generate an exhaustive library. In other words, all possible ordered 
combinations of the nucleic acid building blocks are represented in the set of finalized 
25 chimeric nucleic acid molecules. At the same time, the assembly order (i.e. the order 
of assembly of each building block in the 5' to 3 sequence of each finalized chimeric 
nucleic acid) in each combination is by design (or non-stochastic). Because of the 
non-stochastic nature of the method, the possibility of unwanted side products is 
greatly reduced. 

30 In another embodiment, the method provides that the gene reassembly process 

is performed systematically, for example to generate a systematically 
compartmentalized library, with compartments that can be screened systematically, 
e.g., one by one. In other words the invention provides that, through the selective and 
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5 judicious use of specific nucleic acid building blocks, coupled with the selective and 
judicious use of sequentially stepped assembly reactions, an experimental design can 
be achieved where specific sets of progeny products are made in each of several 
reaction vessels. This allows a systematic examination and screening procedure to be 
performed. Thus, it allows a potentially very large number of progeny molecules to 
1 0 be examined systematically in smaller groups. 

Because of its ability to perform chimerizations in a manner that is highly 
flexible yet exhaustive and systematic as well, particularly when there is a low level 
of homology among the progenitor molecules, the instant invention provides for the 
generation of a library (or set) comprised of a large number of progeny molecules. 
1 5 Because of the non-stochastic nature of the instant gene reassembly invention, the 
progeny molecules generated preferably comprise a library of finalized chimeric 
nucleic acid molecules having an overall assembly order that is chosen by design. In 
a particularly embodiment, such a generated library is comprised of greater than 103 
to greater than 101000 different progeny molecular species. 

20 In one aspect, a set of finalized chimeric nucleic acid molecules, produced as 

described is comprised of a polynucleotide encoding a polypeptide. According to one 
embodiment, this polynucleotide is a gene, which may be a man-made gene. 
According to another embodiment, this polynucleotide is a gene pathway, which may 
be a man-made gene pathway. The invention provides that one or more man-made 

25 genes generated by the invention may be incorporated into a man-made gene 
pathway, such as pathway operable in a eukaryotic organism (including a plant). 

In another exemplification, the synthetic nature of the step in which the 
building blocks are generated allows the design and introduction of nucleotides (e.g., 
one or more nucleotides, which may be, for example, codons or introns or regulatory 
30 sequences) that can later be optionally removed in an in vitro process (e.g., by 

mutagenesis) or in an in vivo process (e.g., by utilizing the gene splicing ability of a 
host organism). It is appreciated that in many instances the introduction of these 
nucleotides may also be desirable for many other reasons in addition to the potential 
benefit of creating a serviceable demarcation point. 
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5 Thus, according to another embodiment, the invention provides that a nucleic 

acid building block can be used to introduce an intron. Thus, the invention provides 
that functional introns may be introduced into a man-made gene of the invention. The 
invention also provides that functional introns may be introduced into a man-made 
gene pathway of the invention. Accordingly, the invention provides for the 
10 generation of a chimeric polynucleotide that is a man-made gene containing one (or 
more) artificially introduced intion(s). 

Accordingly, the invention also provides for the generation of a chimeric 
polynucleotide that is a man-made gene pathway containing one (or more) artificially 
introduced intron(s). Preferably, the artificially introduced intron(s) are functional in 
1 5 one or more host cells for gene splicing much in the way that naturally-occurring 
introns serve functionally in gene splicing. The invention provides a process of 
producing man-made intron-containing polynucleotides to be introduced into host 
organisms for recombination and/or splicing. 

A man-made gene produced using the invention can also serve as a substrate 
20 for recombination with another nucleic acid. Likewise, a man-made gene pathway 
produced using the invention can also serve as a substrate for recombination with 
another nucleic acid. In a preferred instance, the recombination is facilitated by, or 
occurs at, areas of homology between the man-made, intron-containing gene and a 
nucleic acid, which serves as a recombination partner. In a particularly preferred 
25 instance, the recombination partner may also be a nucleic acid generated by the 

invention, including a man-made gene or a man-made gene pathway. Recombination 
may be facilitated by or may occur at areas of homology that exist at the one (or 
more) artificially introduced intron(s) in the man-made gene. 

The synthetic gene reassembly method of the invention utilizes a plurality of 
30 nucleic acid building blocks, each of which preferably has two ligatable ends. The 
two ligatable ends on each nucleic acid building block may be two blunt ends (i.e. 
each having an overhang of zero nucleotides), or preferably one blunt end and one 
overhang, or more preferably still two overhangs. 
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5 A useful overhang for this purpose may be a 3' overhang or a 5' overhang. 

Thus, a nucleic acid building block may have a 3 ' overhang or alternatively a 5 ' 
overhang or alternatively two 3' overhangs or alternatively two 5* overhangs. The 
overall order in which the nucleic acid building blocks are assembled to form a 
finalized chimeric nucleic acid molecule is determined by purposeful experimental 
1 0 design and is not random. 

According to one preferred embodiment, a nucleic acid building block is 
generated by chemical synthesis of two single-stranded nucleic acids (also referred to 
as single-stranded oligos) and contacting them so as to allow them to anneal to form a 
double-stranded nucleic acid building block. 

1 5 A double*stranded nucleic acid building block can be of variable size. The 

sizes of these building blocks can be small or large. Preferred sizes for building block 
range from 1 base pair (not including any overhangs) to 100,000 base pairs (not 
including any overhangs). Other preferred size ranges are also provided, which have 
lower limits of fix)m 1 bp to 10,000 bp (including every integer value in between), and 

20 upper limits of from 2 bp to 100, 000 bp (mcluding every integer value in between). 

Many methods exist by which a double-stranded nucleic acid building block 
can be generated that is serviceable for the invention; and these are known in the art 
and can be readily performed by the skilled artisan. 

Accordmg to one embodiment, a double-stranded nucleic acid building block 
25 is generated by first generating two single stranded nucleic acids and allowing them to 
anneal to form a double-stranded nucleic acid building block. The two strands of a 
double-stranded nucleic acid building block may be complementary at every 
nucleotide apart from any that form an overhang; thus containing no mismatches, 
apart &om any overhang(s). According to another embodiment, the two strands of a 
30 double-stranded nucleic acid building block are complementary at fewer than every 
nucleotide apart from any that form an overhang. Thus, according to thSs 
embodiment, a double-stranded nucleic acid building block can be used to introduce 
codon degeneracy. Preferably the codon degeneracy is introduced using the site- 
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5 saturation mutagenesis described herein, using one or more N,N,GAr cassettes or 
alternatively using one or more NJ4,N cassettes. 

The in vivo recombination method of the invention can be performed blindly 
on a pool of unknown hybrids or alleles of a specific polynucleotide or sequence. 
However, it is not necessary to know the actual DNA or RNA sequence of the specific 
10 polynucleotide. 

The approach of using recombination within a mixed population of genes can 
be useful for the generation of any useful proteins, for example, interleukin I, 
antibodies, tPA and growth hormone. This approach may be used to generate proteins 
having altered specificity or activity. The approach may also be useful for the 
1 5 generation of hybrid nucleic acid sequences, for example, promoter regions, introns, 
exons, enhancer sequences, 31 untranslated regions or 51 untranslated regions of 
genes. Thus this approach may be used to generate genes having increased rates of 
expression. This approach may also be useful in the study of repetitive DNA 
sequences. Finally, this approach may be useful to mutate ribozymes or aptamers. 

20 In one aspect the invention described herein is directed to the use of repeated 

cycles of reductive reassortment, recombination and selection which allow for the 
directed molecular evolution of highly complex linear sequences, such as DNA, RNA 
or proteins thorough recombination. 

In vivo shuffling of molecules is useful in providing variants and can be 
25 performed utilizing the natural property of cells to recombine multimers. While 
recombination in vivo has provided the major natural route to molecular diversity, 
genetic recombination remains a relatively complex process that involves 1) the 
recognition of homologies; 2) strand cleavage, strand invasion, and metabolic steps 
leading to the production of recombinant chiasma; and finally 3) the resolution of 
30 chiasma into discrete recombined molecules. The formation of the chiasma requires 
the recognition of homologous sequences. 

In another embodiment, the invention includes a method for producing a 
hybrid polynucleotide fi-om at least a first polynucleotide and a second 
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5 polynucleotide. The invention can be used to produce a hybrid polynucleotide by 
introducing at least a first polynucleotide and a second polynucleotide which share at 
least one region of partial sequence homology into a suitable host cell. The regions of 
partial sequence homology promote processes which resuU in sequence reorganization 
producing a hybrid polynucleotide. The term "hybrid polynucleotide", as used herein, 

10 is any nucleotide sequence which results from the method of the present invention and 
contains sequence from at least two original polynucleotide sequences. Such hybrid 
polynucleotides can result from intermolecular recombination events which promote 
sequence integration between DNA molecules. In addition, such hybrid 
polynucleotides can result from intramolecular reductive reassortment processes 

1 5 which utilize repeated sequences to alter a nucleotide sequence within a DNA 
molecule. 

The invention provides a means for generating hybrid polynucleotides which 
may encode biologically active hybrid polypeptides (e.g., hybrid alpha amylases). In 
one aspect, the original polynucleotides encode biologically active polypeptides. The 

20 method of the invention produces new hybrid polypeptides by utilizing cellular 

processes which integrate the sequence of the original polynucleotides such that the 
resulting hybrid polynucleotide encodes a polypeptide demonstrating activities 
derived from the original biologically active polypeptides. For example, the original 
polynucleotides may encode a particular enzyme from different microorganisms. An 

25 enzyme encoded by a first polynucleotide from one organism or variant may, for 
example, function effectively under a particular environmental condition, e.g. high 
salinity. An enzyme encoded by a second polynucleotide from a different organism 
or variant may function effectively under a different environmental condition, such as 
extremely high temperatures. A hybrid polynucleotide containing sequences from the 

30 first and second original polynucleotides may encode an enzyme which exhibits 
characteristics of both enzymes encoded by the original polynucleotides. Thus, the 
enzyme encoded by the hybrid polynucleotide may function effectively under 
environmental conditions shared by each of the enzymes encoded by the first and 
second polynucleotides, e.g., high salinity and extreme temperatures. 
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5 Enzymes encoded by the polynucleotides of the invention include, but are not 

limited to, hydrolases, such as alpha amylases and alkaline amylases. A hybrid 
polypeptide resulting from the method of the invention may exhibit specialized 
enzyme activity not displayed in the original enzymes. For example, following 
recombination and/or reductive reassortment of polynucleotides encoding hydrolase 

10 activities, the resulting hybrid polypeptide encoded by a hybrid polynucleotide can be 
screened for specialized hydrolase activities obtained from each of flie original 
enzymes, i.e. tiie type of bond on which tiie hydrolase acts and die tempenrture at 
which the hydrolase fimctions. Thus, for example, tiie hydrolase may be screened to 
ascertain tiiose chemical functionalities which distinguish tiie hybrid hydrolase from 

15 tiie original hydrolases, such as: (a) amide (peptide bonds), i.e., proteases; (b) ester 
bonds, i.e., amylases and lipases; (c) acetals, i.e., glycosidases and, for example, the 
temperature, pH or salt concenti:ation at which tiie hybrid polypeptide functions. 

Sources of tfie original polynucleotides may be isolated from individual 
organisms ("isolates"), collections of organisms ttiat have been grown in defined 
20 media ("enrichment cultures"), or, uncultivated organisms ("environmental samples"). 
The use of a culture-independent ^proach to derive polynucleotides encoding novel 
bioactivities &om environmental samples is most preferable since it allows one to 
access untapped resources of biodiversity. 

"Environmental libraries" are generated from environmental samples and 
25 represent the collective genomes of naturally occurring organisms archived in cloning 
vectors tiiat can be propagated in suitable prokaryotic hosts. Because tiie cloned DNA 
is initially extracted directly from environmental samples, tfie libraries are not limited 
to the small fiction of prokaryotes tiiat can be grown in pure culture. Additionally, a 
nomialization of tiie environmental DNA present in tiiese samples could allow more 
30 equal representation of ttie DNA from all of tiie species present in the original sample. 
This can dramatically increase flie efficiency of finding interesting genes from minor 
constihients of tiie sample which may be under-represented by several orders of 
magnitude compared to the dominant species. 
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5 For exanq)le, gene libraries generated from one or more uncultivated 

microorganisms arc screened for an activity of interest. Potential pathways encoding 
bioactive molecules of interest are first captured in prokaryotic cells in the form of 
gene expression libraries. Polynucleotides encoding activities of interest are isolated 
from such libraries and introduced into a host cell. The host cell is grown under 
1 0 conditions which promote recombination and/or reductive reassortment creating 
potentially active biomolecules with novel or enhanced activities. 

The microorganisms from which the polynucleotide may be prepared include 
prokaryotic microorganisms, such as Eubacteria and Archaebacteria, and lower 
eukaryotic microorganisms such as ftmgi, some algae and protozoa. Polynucleotides 

1 5 may be isolated from environmental samples in which case the nucleic acid may be 
recovered without culturing of an organism or recovered from one or more cultured 
organisms. In one aspect, such microorganisms may be extremophiles, such as 
hyperthermophiles, psychrophiles, psychrotrophs, halophiles, barophiles and 
acidophiles. Polynucleotides encoding enzymes isolated from extremophilic 

20 microorganisms are particularly preferred. Such enzymes may ftmction at 

temperatures above lOOX in terrestrial hot springs and deep sea thermal vents, at 
temperatures below 0**C in arctic waters, in the saturated salt environment of the Dead 
Sea, at pH values around 0 in coal deposits and geothermal sulfiir-rich springs, or at 
pH values greater than 1 1 in sewage sludge. For exan^le, several amylases and 

25 lipases cloned and expressed from extremophilic organisms show high activity 
throughout a wide range of temperatures and pHs. 

Of the novel enzymes of the present invention, many have been purified and 
characterized at pH 8, at both 40**C and 50*C, and pH 10 at both 40''C and 50**C. Of 
the enzymes found to be purified and characterized at pH 8 and 40^C, was seen to 
30 have three times (682 U/mg) the specific activity of a B. lichenoformis enzyme (228 
U/mg). Additionally, another enzyme was seen to have approximately equivalent 
activity (250U/mg) to the B. lichenoformis enzyme. At a pH 10 and 50T, one of the 
enzymes has a specific activity of 31U/mg and another has a specific activity of 27.5 
U/mg, while B. lichenoformis has a specific activity of 27 U/mg. 
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5 Polynucleotides selected and isolated as hereinabove described are introduced 

into a suitable host ceil. A suitable host cell is any cell which is enable of promoting 
recombination and/or reductive reassortment The selected polynucleotides are 
preferably already in a vector which includes appropriate control sequences. The host 
cell can be a higher cukaryotic cell, such as a mammalian cell, or a lower eukaryotic 
10 cell, such as a yeast cell, or preferably, the host cell can be a prokaryotic cell, such as 
a bacterial cell. Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis et al, 1 986). 

As representative examples of appropriate hosts, there may be mentioned: 
15 bacterial cells, such as E, coli, Streptomyces, Salmonella typhimurium\ fungal cells, 
such as yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells 
such as CHO, COS or Bowes melanoma; adenoviruses; and plant cells. The selection 
of an appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. 

20 With particular references to various mammalian cell culture systems that can 

be employed to express recombinant protein, examples of manunalian expression 
systems include the COS-7 lines of monkey kidney fibroblasts, described in "SV40- 
transformed simian cells support the rq)lication of early SV40 mutants*' (Gluzman, 
1981), and other cell lines capable of expressing a compatible vector, for example, the 

25 C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors will 
comprise an origin of replication, a suitable promoter and enhancer, and also any 
necessary ribosome binding sites, polyadenylation site, splice donor and acceptor 
sites, transcriptional termination sequences, and 5* flanking nontranscribed sequences. 
DNA sequences derived from the SV40 splice, and polyadenylation sites may be used 

30 to provide the required nontranscribed genetic elements. 

Host cells containing the polynucleotides of interest can be cultured in 
conventional nutrient media modified as appropriate for activating promoters, 
selecting transformants or amplifying genes. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected for 
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5 expression, and will be apparent to the ordinarily skilled artisan. The clones which 
are identified as having the specified enzyme activity may then be sequenced to 
identify the polynucleotide sequence encoding an enzyme having the enhanced 
activity. 

In another aspect, it is envisioned the method of the present invention can be 

10 used to generate novel polynucleotides encoding biochemical pathways from one or 
more operons or gene clusters or portions thereof. For example, bacteria and many 
eukaryotes have a coordinated mechanism for regulating genes whose products are 
involved in related processes. The genes are clustered, in structures referred to as 
"gene clusters," on a single chromosome and are transcribed together under the 

1 5 control of a single regulatory sequence, including a single promoter which initiates 
transcription of the entire cluster. Thus, a gene cluster is a group of adjacent genes 
that are either identical or related, usually as to their function. An example of a 
biochemical pathway encoded by gene clusters are polyketides. Polyketides are 
molecules which are an extremely rich source of bioactivities, including antibiotics 

20 (such as tetracyclines and erythromycin), anti-cancer agents (daunomycin), 

immunosuppressants (FK506 and rapamycin), and veterinary products (monensin). 
Many pol^etides (produced by polyketide synthases) are valuable as therapeutic 
agents. Polyketide synthases are multifunctional enzymes that catalyze the 
biosynthesis of an enormous variety of carbon chains differing in length and patterns 

25 of functionality and cyclization. Pol>icetide synthase genes fall into gene clusters and 
at least one type (designated type I) of polyketide synthases have large size genes and 
enzymes, complicating genetic manipulation and in vitro studies of these 
genes/proteins. 

Gene cluster DNA can be isolated from different organisms and ligated into 
30 vectors, particularly vectors containing expression regulatory sequences which can 
control and regulate the production of a detectable protein or protein-related array 
activity from the ligated gene clusters. Use of vectors which have an exceptionally 
large capacity for exogenous DNA introduction are particularly appropriate for use 
with such gene clusters and are described by way of example herein to include the f- 
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5 factor (or fertility factor) of E. colL This f-factor of £. coli is a plasmid which affect 
high-frequency transfer of itself during conjugation and is ideal to achieve and stably 
propagate large DNA fragments, such as gene clusters from mixed microbial samples. 
A particularly preferred embodiment is to use cloning vectors, referred to as 
"fosmids" or bacterial artificial chromosome (BAC) vectors. These are derived fix)m 

10 £. coli f-factor which is able to stably integrate large segments of genomic DNA. 
When integrated with DNA from a mixed uncultured environmental sample, this 
makes it possible to achieve large genomic fragments in the form of a stable 
"environmental DNA library." Another type of vector for use in the present invention 
is a cosmid vector. Cosmid vectors were originally designed to clone and propagate 

1 5 large segments of genomic DNA. Cloning into cosmid vectors is described in detail 
in Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
Harbor Laboratory Press (1989). Once ligated into an impropriate vector, two or more 
vectors containing different polyketide synthase gene clusters can be introduced into a 
suitable host cell. Regions of partial sequence homology shared by the gene clusters 

20 will promote processes which result in sequence reorganization resulting in a hybrid 
gene cluster. The novel hybrid gene cluster can then be screened for enhanced 
activities not found in the original gene clusters. 

Therefore, in a one embodiment, the invention relates to a method for 
producing a biologically active hybrid polypeptide and screening such a polypeptide 
25 for enhanced activity by: 

1) introducing at least a first polynucleotide in operable linkage and a 
second polynucleotide in operable linkage, said at least first polynucleotide and 
second polynucleotide sharing at least one region of partial sequence homology, into a 
suitable host cell; 

30 2) growing the host cell under conditions which promote sequence 

reorganization resulting in a hybrid polynucleotide in operable linkage; 

3) expressing a hybrid polypeptide encoded by the hybrid polynucleotide; 
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5 4) screening the hybrid polypeptide under conditions which promote 

identification of enhanced biological activit)r, and 

5) isolating the a polynucleotide encoding the hybrid polypeptide. 

Methods for screening for various enzyme activities are known to those of 
skill in the art and are discussed throughout the present specification. Such methods 
10 may be employed when isolating the polypeptides and polynucleotides of the 
invention. 

As representative examples of expression vectors which may be used, there 
may be mentioned viral particles, baculovirus, phage, plasmids, phagemids, cosmids, 
fosmids, bacterial artificial chromosomes, viral DNA (e.g., vaccinia, adenovirus, foul 

1 5 pox virus, pseudorabies and derivatives of SV40), PI -based artificial chromosomes, 
yeast plasmids, yeast artificial chromosomes, and any other vectors specific for 
specific hosts of interest (such as bacillus, aspergillus and yeast). Thus, for example, 
the DNA may be included in any one of a variety of expression vectors for expressing 
a polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic 

20 DNA sequences. Large numbers of suitable vectors are known to those of skill in the 
art, and are conunercially available. The following vectors are provided by way of 
example; Bacterial: pQE vectors (Qiagen), pBluescript plasmids, pNH vectors, 
(lambda-ZAP vectors (Stratagene); ptrc99a, pKK223-3, pDR540, pRIT2T 
(Pharmacia); Eukaryotic: pXTl, pSG5 (Stratagene), pSVK3, pBPV, pMSG, 

25 pSVLSV40 (Pharmacia). However, any other plasmid or other vector may be used so 
long as they are replicable and viable in the host. Low copy number or high copy 
number vectors may be employed with the present invention. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate expression control sequence(s) (promoter) to direct RNA synthesis. 
30 Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, lambda PR, PL 
and trp. Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, 
early and late SV40, LTRs from retrovirus, and mouse metallothionein-L Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
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5 The expression vector also contains a ribosome binding site for translation initiation 
and a transcription terminator. The vector may also include appropriate sequences for 
amplifying expression. Promoter regions can be selected from any desired gene using 
chloramphenicol transferase (CAT) vectors or other vectors with selectable markers. 
In addition, the expression vectors preferably contain one or more selectable marker 
10 genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. colL 

In vivo reassortment is focused on "inter-molecular" processes collectively 
referred to as 'Recombination" which in bacteria, is generally viewed as a "RecA- 
1 5 dependent" phenomenon. The invention can rely on recombination processes of a 
host cell to recombine and re-assort sequences, or the cells* ability to mediate 
reductive processes to decrease the complexity of quasi-repeated sequences in the cell 
by deletion. This process of "reductive reassortment" occurs by an "intra-molecular", 
RecA-independent process. 

20 Therefore, in another aspect of the invention, novel polynucleotides can be 

generated by the process of reductive reassortment. The method involves the 
generation of constructs containing consecutive sequences (original encoding 
sequences), their insertion into an appropriate vector, and their subsequent 
introduction into an appropriate host cell. The reassortment of the individual 

25 molecular identities occurs by combinatorial processes between the consecutive 

sequences in the construct possessing regions of homology, or between quasi-repeated 
units. The reassortment process recombines and/or reduces the complexity and extent 
of the repeated sequences, and results in the production of novel molecular species. 
Various treatments may be applied to enhance the rate of reassortment. These could 

30 include treatment with ultra-violet light, or DNA damaging chemicals, and/or the use 
of host cell lines displaying enhanced levels of "genetic instability". Thus the 
reassortment process may involve homologous recombination or the natural property 
of quasi-repeated sequences to direct their own evolution. 
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5 Repeated or "quasi-repeated" sequences play a role in genetic instability. In 

the present invention, "quasi-repeats'* are repeats that are not restricted to their 
original unit structure. Quasi-repeated imits can be presented as an array of sequences 
in a construct; consecutive units of similar sequences. Once ligated, the junctions 
between the consecutive sequences become essentially invisible and the quasi- 

1 0 repetitive nature of the resulting construct is now continuous at the molecular level. 
The deletion process the cell performs to reduce the complexity of the resulting 
construct operates between the quasi-repeated sequences. The quasi-repeated units 
provide a practically limitless repertoire of templates upon which slippage events can 
occur. The constructs containing the quasi-rq>eats thus effectively provide sufficient 

15 molecular elasticity that deletion (and potentially insertion) events caii occur virtually 
anywhere within the quasi-repetitive units. 

When the quasi-repeated sequences are all ligated in the same orientation, for 
instance head to tail or vice versa, the cell cannot distinguish individual units. 
Consequently, the reductive process can occur throughout the sequences. In contrast, 

20 when for example, the units are presented head to head, rather than head to tail, the 
inversion delineates the endpoints of the adjacent unit so that deletion formation will 
favor the loss of discrete units. Thus, it is preferable with the present method that the 
sequences are in the same orientation. Random orientation of quasi-repeated 
sequences will result in the loss of reassortment efficiency, while consistent 

25 orientation of the sequences will offer the highest efficiency. However, while having 
fewer of the contiguous sequences in the same orientation decreases the efficiency, it 
may still provide sufficient elasticity for the effective recovery of novel molecules. 
Constructs can be made with the quasi-repeated sequences in the same orientation to 
allow higher efficiency. 

30 Sequences can be assembled in a head to tail orientation using any of a variety 

of methods, including the following: 

a) Primers that include a poly-A head and poly-T tail which when made 
single-stranded would provide orientation can be utilized. This is accomplished by 



wo 02/068589 



PCT/US02/05068 



40 

5 having the first few bases of the primers made from RNA and hence easily removed 
RNAseH. 

b) Primers that include unique restriction cleavage sites can be utilized. 
Multiple sites, a battery of unique sequences, and repeated synthesis and ligation steps 
would be required. 

10 c) The inner few bases of the primer could be thiolated and an 

exonuclease used to produce properly tailed molecules. 

The recovery of the re-assorted sequences relies on the identification of 
cloning vectors with a reduced repetitive index (RI). The re-assorted encoding 
sequences can then be recovered by amplification. The products are re-cloned and 
15 expressed. The recovery of cloning vectors with reduced RI can be affected by: 

1) The use of vectors only stably maintained when the construct is 
reduced in complexity. 

2) The physical recovery of shortened vectors by physical procedures. In 
this case, the cloning vector would be recovered using standard plasmid isolation 

20 procedures and size fi'actionated on either an agarose gel, or column with a low 
molecular weight cut off utilizing standard procedures. 

3) The recovery of vectors containing interrupted genes which can be 
selected when insert size decreases. 

4) The use of direct selection techniques with an expression vector and 
25 the appropriate selection. 

Encoding sequences (for example, genes) from related organisms may 
demonstrate a high degree of homology and encode quite diverse protein products. 
These types of sequences are particularly useful in the present invention as quasi- 
repeats. However, while the examples illustrated below demonstrate the reassortment 
30 of nearly identical original encoding sequences (quasi-repeats), this process is not 
limited to such nearly identical repeats. 
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5 The following example demonstrates a method of the invention. Encoding 

nucleic acid sequences (quasi-repeats) derived from three (3) unique species are 
described. Each sequence encodes a protein with a distinct set of properties. Each of 
the sequences differs by a single or a few base pairs at a unique position in the 
sequence. The quasi-repeated sequences are separately or collectively amplified and 
10 ligated into random assemblies such that all possible permutations and combinations 
are available in the population of ligated molecules. The number of quasi-repeat units 
can be controlled by the assembly conditions. The average number of quasi-repeated 
units in a construct is defined as the repetitive index (RI). 

Once formed, the constructs may, or may not be size fractionated on an 
1 5 agarose gel according to published protocols, inserted into a cloning vector, and 

transfected into an appropriate host cell. The cells are then propagated and "reductive 
reassortment" is effected. The rate of the reductive reassortment process may be 
stimulated by the introduction of DNA damage if desired. Whether the reduction in 
RI is mediated by deletion formation between repeated sequences by an "intra- 
20 molecular"' mechanism, or mediated by recombination-like events through "inter- 
molecular" mechanisms is immaterial. The end result is a reassortment of the 
molecules into all possible combinations. 

Optionally, the method comprises the additional step of screening the library 
members of the shuffled pool to identify individual shuffled library members having 
25 the ability to bind or otherwise interact, or catalyze a particular reaction (e.g., such as 
catalytic domain of an enzyme) with a predetermined macromolecule, such as for 
example a proteinaceous recq^tor, an oligosaccharide, viron, or other predetermined 
compound or structure. 

The polypeptides that are identified from such libraries can be used for 
30 thenq)eutic, diagnostic, research and related purposes (e.g., catalysts, solutes for 

increasing osmolarity of an aqueous solution, and the like), and/or can be subjected to 
one or more additional cycles of shuffling and/or selection. 



wo 02/068589 



PCT/US02/05068 



42 

5 In another aspect, it is envisioned that prior to or during recombination or 

reassortment, polynucleotides generated by the method of the invention can be 
subjected to agents or processes which promote the introduction of mutations into the 
original polynucleotides. The introduction of such mutations would increase the 
diversity of resulting hybrid polynucleotides and polypeptides encoded therefrom. 

1 0 The agents or processes which promote mutagenesis can include, but are not limited 
to: (+)-CC-1065, or a synthetic analog such as (+)-CC-1065-(N3. Adenine (See Sun 
and Hurley, (1992); an N-acclylated or deacetylated 4'-fluro-4-aminobiphenyl adduct 
capable of inhibiting DNA synthesis (See , for example, van de Poll et al. (1992)); or 
a N-acetylated or deacetylated 4-aminobiphenyl adduct capable of inhibiting DNA 

15 synthesis (See also, van de Poll et al. (1992), pp. 751-758); trivalent chromium, a 
trivalent chromium salt, a polycyclic aromatic hydrocarbon (PAH) DNA adduct 
capable of inhibiting DNA replication, such as 7-bromomethyl.benz[a]anthracene 
("BMA"), tris(2,3^ibromopropyl)phosphate CTris-BP"), l,2-dibromo-3- 
chloropropanc ("DBCF'), 2-bromoacrolein (2BA), benzo[a]pyrene-7,8Klihydrodiol- 

20 9-lO^oxide ("BPDE*'). a platinum(U) halogen salt, N-hydroxy.2-amino-3- 
methylimidazo[4,5-f]-quinoline ("N-hydroxy-IQ'*), and N-hydroxy-2-amino.l- 
methyl-6-phenylimidazo[4,5-f]-pyridine CTST-hydroxy-PhlP"). Especially preferred 
means for slowing or halting PCR amplification consist of UV light (+)-CC-1065 and 
(+)-CC-1065-(N3-Adenine). Particularly encompassed means are DNA adducts or 

25 polynucleotides comprising the DNA adducts from the polynucleotides or 

polynucleotides pool, which can be released or removed by a process including 
heating the solution comprising the polynucleotides prior to further processing. 

In another aspect the invention is directed to a method of producing 
recombinant proteins having biological activity by treating a sample comprising 
30 double-stranded template polynucleotides encoding a wild-type protein under 

conditions according to the invention which provide for the production of hybrid or 
re-assorted polynucleotides. 

The invention also provides for the use of proprietary codon primers 
(containing a degenerate N,N,N sequence) to introduce point mutations into a 
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5 polynucleotide, so as to generate a set of progeny polypeptides in which a full range 
of single amino acid substitutions is represented at each amino acid position (gene site 
saturated mutagenesis (GSSM)). The oligos used are comprised contiguously of a 
first homologous sequence, a degenerate N,N,N sequence, and preferably but not 
necessarily a second homologous sequence. The downstream progeny translational 
10 products firom the use of such oligos include all possible amino acid changes at each 
amino acid site along the polypeptide, because the degeneracy of the N,N,N sequence 
includes codons for all 20 amino acids. 

In one aspect, one such degenerate oligo (comprised of one degenerate N,N,N 
cassette) is used for subjecting each original codon in a parental polynucleotide 

15 template to a full range of codon substitutions. In another aspect, at least two 

degenerate N,N,N cassettes are used - either in the same oligo or not, for subjecting at 
least two origmal codons in a parental polynucleotide template to a full range of 
codon substitutions. Thus, more than one N,N,N sequence can be contained in one 
oligo to introduce amino acid mutations at more than one site. This plurality of 

20 N,N,N sequences can be directly contiguous, or separated by one or more additional 
nucleotide sequence(s). In another aspect, oligos serviceable for introducing additions 
and deletions can be used either alone or in combination with the codons containing 
an N,N,N sequence, to introduce any combination or permutation of amino acid 
additions, deletions, and/or substitutions. 

25 In a particular exemplification, it is possible to simultaneously mutagenize two 

or more contiguous amino acid positions using an oligo that contains contiguous 
N,N,N triplets, i.e. a degenerate (N,N,N)n sequence. 

hi another aspect, the present invention provides for the use of degenerate 
cassettes having less degeneracy than the N,N,N sequence. For example, it may be 
30 desirable in some instances to use (e.g. in an oligo) a degenerate triplet sequence 
comprised of only one N, where said N can be in the first second or third position of 
the triplet. Any other bases including any combinations and permutations thereof can 
be used in the remaining two positions of the triplet. Alternatively, it may be 
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5 desirable in some instances to use (e.g., in an oligo) a degenerate N,NJ4 triplet 
sequence, N,N.G/T, or an N.N, G/C triplet sequence. 

It is appreciated, however, that the use of a degenerate triplet (such as 
N,N,G/T or an N,N, G/C triplet sequence) as disclosed in the instant invention is 
advantageous for several reasons. In one aspect, this invention provides a means to 
10 systematically and fairly easily generate the substitution of the full range of possible 
amino acids (for a total of 20 amino acids) into each and every amino acid position in 
a polypeptide. Thus, for a 100 amino acid polypeptide, the invention provides a way 
to systematically and fairly easily generate 2000 distinct species (i.e.. 20 possible 
amino acids per position times 1 00 amino acid positions). It is appreciated that there 
1 5 is provided, through the use of an oligo containing a degenerate N,N,G/T or an N,N, 
G/C triplet sequence, 32 individual sequences that code for 20 possible amino acids. 
Thus, in a reaction vessel in which a parental polynucleotide sequence is subjected to 
saturation mutagenesis using one such oligo, there are generated 32 distinct progeny 
polynucleotides encoding 20 distinct polypeptides. In contrast, the use of a non- 
20 degenerate oligo in site-directed mutagenesis leads to only one progeny polypeptide 
product per reaction vessel. 

This invention also provides for the use of nondegenerate oligos, which can 
optionally be used in combination with degenerate primers disclosed. It is appreciated 
that in some situations, it is advantageous to use nondegenerate oligos to generate 
25 specific point mutations in a working polynucleotide. This provides a means to 
generate specific silent point mutations, point mutations leading to corresponding 
amino acid changes, and point mutations that cause the generation of stop codons and 
the corresponding expression of polypeptide fragments. 

Thus, in a preferred embodiment of this invention, each saturation 
30 mutagenesis reaction vessel contains polynucleotides encoding at least 20 progeny 
polypeptide molecules such that all 20 amino acids are represented at the one specific 
amino acid position corresponding to the codon position mutagenized in the parental 
polynucleotide. The 32-fold degenerate progeny polypeptides generated from each 
saturation mutagenesis reaction vessel can be subjected to clonal amplification (e.g.. 
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5 cloned into a suitable E. coli host using an expression vector) and subjected to 
expression screening. When an individual progeny polypeptide is identified by 
screening to display a favorable change in property (when compared to the parental 
polypeptide), it can be sequenced to identify the correspondingly favorable amino 
acid substitution contained therein. 

^0 It is appreciated that upon mutagenizing each and every amino acid position in 

a parental polypeptide using saturation mutagenesis as disclosed herein, favorable 
amino acid changes may be identified at more than one amino acid position. One or 
more new progeny molecules can be generated that contain a combination of all or 
part of these favorable amino acid substitutions. For example, if 2 specific favorable 

1 5 amino acid changes are identified in each of 3 amino acid positions in a polypeptide, 
the permutations include 3 possibilities at each position (no change fi-om the original 
amino acid, and each of two favorable changes) and 3 positions. Thus, there are 3 x 3 
X 3 or 27 total possibilities, including 7 that were previously examined - 6 single point 
mutations (i.e., 2 at each of three positions) and no change at any position. 

20 In yet another aspect, site-saturation mutagenesis can be used together with 

shuffling, chimerization, recombination and other mutagenizing processes, along with 
screening. This invention provides for the use of any mutagenizing process(es), 
including saturation mutagenesis, in an iterative manner. In one exemplification, the 
iterative use of any mutagenizing process(es) is used in combination with screening. 

25 Thus, in a non-limiting exemplification, this invention provides for the use of 

saturation mutagenesis in combination with additional mutagenization processes, such 
as process where two or more related polynucleotides are introduced into a suitable 
host cell such that a hybrid polynucleotide is generated by recombination and 
reductive reassortment. 

30 In addition to performing mutagenesis along the entire sequence of a gene, the 

instant invention provides that mutagenesis can be use to replace each of any number 
of bases in a polynucleotide sequence, wherein the number of bases to be 
mutagenized is preferably every integer fi-om 15 to 100,000. Thus, instead of 
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5 mutagenizing every position along a molecule, one can subject every or a discrete 
number of bases (preferably a subset totaling from 15 to 100,000) to mutagenesis. 
Preferably, a separate nucleotide is used for mutagenizing each position or group of 
positions along a polynucleotide sequence, A group of 3 positions to be mutagenized 
may be a codon. The mutations are preferably introduced using a mutagenic primer, 
10 containing a heterologous cassette, also referred to as a mutagenic cassette. Preferred 
cassettes can have from 1 to 500 bases. Each nucleotide position in such heterologous 
cassettes be N, A, C. G, T, A/C, A/G, ATT, OG, CAT, G/T, C/G/T, A/G/T, A/C/T, 
A/C/G, or E, where E is any base that is not A, C, G, or T (E can be referred to as a 
designer oligo). 

15 In a general sense, saturation mutagenesis is comprised of mutagenizing a 

complete set of mutagenic cassettes (wherein each cassette is preferably about 1-500 
bases in length) in defined polynucleotide sequence to be mutagenized (wherein the 
sequence to be mutagenized is preferably from about 15 to 100,000 bases in length). 
Thus, a group of mutations (ranging from 1 to 100 mutations) is introduced into each 

20 cassette to be mutagenized. A grouping of mutations to be introduced into one 
cassette can be different or the same from a second grouping of mutations to be 
introduced into a second cassette during the application of one round of saturation 
mutagenesis. Such groupings are exemplified by deletions, additions, groupings of 
particular codons, and groupings of particular nucjeotide cassettes. 

25 Defined sequences to be mutagenized include a whole gene, pathway, cDNA, 

an entire open reading frame (ORF), and entire promoter, enhancer, 
repressor/transactivator, origin of replication, intron, operator, or any polynucleotide 
functional group. Generally, a "defined sequences" for this purpose may be any 
polynucleotide that a 15 base-pplynucleotide sequence, and polynucleotide sequences 

30 of lengths between 1 5 bases and 1 5,000 bases (this invention specifically names every 
integer in between). Considerations in choosing groupings of codons include types of 
amino acids encoded by a degenerate mutagenic cassette. 

In a particularly preferred exemplification a grouping of mutations that can be 
introduced into a mutagenic cassette, this invention specifically provides for 
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5 degenerate codon substitutions (using degenerate oligos) that code for 2, 3, 4, 5, 6, 7, 
8, 9, 10. 1 1, 12, 13, 14, 15, 16. 17, 18, 19, and 20 amino acids at each position, and a 
library of polypeptides encoded thereby. 

One aspect of the invention is an isolated nucleic acid comprising one of the 
sequences of Group A nucleic acid sequences, and sequences substantially identical 

1 0 thereto, the sequences complementary thereto, or a fragment comprising at least 1 0, 
1 5, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive bases of 
one of the sequences of a Group A nucleic acid sequence (or the sequences 
complementary thereto). The isolated, nucleic acids may comprise DNA, including 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double-stranded or 

1 5 single-stranded, and if single stranded may be the coding strand or non-coding (anti- 
sense) strand. Alternatively, the isolated nucleic acids may comprise RNA. 

As discussed in more detail below, the isolated nucleic acids of one of the 
Group A nucleic acid sequences, and sequences substantially identical thereto, may be 
used to prepare one of the polypeptides of a Group B amino acid sequence, and 
20 sequences substantially identical thereto, or fragments comprising at least 5, 10, 15, 
20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of one of the 
polypeptides of Group B amino acid sequences, and sequences substantially identical 
thereto. 

Accordingly, another aspect of the mvention is an isolated nucleic acid which 
25 encodes one of the polypeptides of Group B amino acid sequences, and sequences 
substantially identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 30, 
35, 40, 50, 75, 100, or 150 consecutive amino acids of one of the polypeptides of the 
Group B amino acid sequences. The coding sequences of these nucleic acids may be 
identical to one of the coding sequences of one of the nucleic acids of Group A 
30 nucleic acid sequences, or a fragment thereof or may be different coding sequences 
which encode one of the polypeptides of Group B amino acid sequences, sequences 
substantially identical thereto, and fragments having at least 5, 10, 15, 20, 25, 30, 35, 
40, 50, 75, 100, or 150 consecutive amino acids of one of the polypeptides of Group B 
amino acid sequences, as a resuh of the redundancy or degeneracy of the genetic 
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5 code. The genetic code is well known to those of skill in the art and can be obtained, 
for example, on page 214 of B. Lewin, Genes VI, Oxford University Press, 1997, the 
disclosure of which is incorporated herein by reference. 

The isolated nucleic acid which encodes one of the polypeptides of Group B 
amino acid sequences, and sequences substantially identical thereto, may include, but 

10 is not limited to: only the coding sequence of one of Group A nucleic acid sequences, 
and sequences substantially identical thereto, and additional coding sequences, such 
as leader sequences orproprotein sequences and non-coding sequences, such as 
introns or non-coding sequences 5* and/or 3' of the coding sequence. Thus, as used 
herein, the term "polynucleotide encoding a polypeptide" encompasses a 

1 5 polynucleotide which includes only the coding sequence for the polypeptide as well as 
a polynucleotide which includes additional coding and/or non-coding sequence. 

Alternatively, the nucleic acid sequences of Group A nucleic acid sequences, 
and sequences substantially identical thereto, may be mutagenized using conventional 
techniques, such as site directed mutagenesis, or other techniques familiar to those 

20 skilled in the art, to introduce silent changes into the polynucleotides of Group A 

nucleic acid sequences, and sequences substantially identical thereto. As used herein, 
"silent changes" include, for example, changes which do not alter the amino acid 
sequence encoded by the polynucleotide. Such changes may be desirable in order to 
increase the level of the polypeptide produced by host cells containing a vector 

25 encoding the polypeptide by introducing codons or codon pairs which occur 
frequently in the host organism. 

The invention also relates to polynucleotides which have nucleotide changes 
which result in amino acid substitutions, additions, deletions, fusions and truncations 
in the polypeptides of Group B amino acid sequences, and sequences substantially 
30 identical thereto. Such nucleotide changes may be introduced using techniques such 
as site directed mutagenesis, random chemical mutagenesis, exonuclease HI deletion, 
and other recombinant DNA techniques. Alternatively, such nucleotide changes may 
be naturally occurring allelic variants which are isolated by identifying nucleic acids 
which specifically hybridize to probes comprising at least 10, 15, 20, 25, 30, 35, 40, 
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5 50. 75, 1 00. 1 50, 200, 300, 400, or 500 consecutive bases of one of the sequences of 
Group A nucleic acid sequences, and sequences substantially identical thereto (or the 
sequences complementary thereto) under conditions of high, moderate, or low 
stringency as provided herein. 

The isolated nucleic acids of Group A nucleic acid sequences, and sequences 
1 0 substantially identical thereto, the sequences complementary thereto, or a fragment 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the sequences of Group A nucleic acid sequences, and 
sequences substantially identical thereto, or the sequences complementary thereto 
may also be used as probes to determine whether a biological sample, such as a soil 
15 sample, contains an organism having a nucleic acid sequence of the invention or an 
organism from which the nucleic acid was obtained. In such procedures, a biological 
sample potentially harboring the organism from which the nucleic acid was isolated is 
obtained and nucleic acids are obtained from the sample. The nucleic acids are 
contacted with the probe under conditions which permit the probe to specifically 
20 hybridize to any complementary sequences &om which are present therein. 

Where necessary, conditions which permit the probe to specifically hybridize 
to complementary sequences may be determined by placing the probe in contact with 
complementary sequences fiom samples known to contain the complementary 
sequence as well as control sequences which do not contain the complementary 
25 sequence. Hybridization conditions, such as the salt concentration of the 

hybridization buffer, the formamide concentration of the hybridization buffer, or the 
hybridization temperature, may be varied to identify conditions which allow the probe 
to hybridize specifically to complementary nucleic acids. 

If the sample contains the organism from which the nucleic acid was isolated, 
30 specific hybridization of the probe is then detected. Hybridization may be detected by 
labeling the probe with a detectable agent such as a radioactive isotope, a fluorescent 
dye or an enzyme capable of catalyzing the formation of a detectable product. 
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5 Many methods for using the labeled probes to detect the presence of 

complementary nucleic acids in a sample are familiar to those skilled in the art. These 
include Southern Blots, Northern Blots, colony hybridization procedures, and dot 
blots. Protocols for each of these procedures are provided in Ausubel et al Current 
Protocols in Molecular Biology, John Wiley 503 Sons, Inc. (1997) and Sambrook et 
10 al.. Molecular Cloning: A Laboratory Manual 2nd Ed., Cold Spring Harbor 

Laboratory Press (1989), the entire disclosures of which are incorporated herein by 
reference. 

Alternatively, more than one probe (at least one of which is capable of 
specifically hybridizing to any complementary sequences which are present in the 

15 nucleic acid sample), may be used in an amplification reaction to determine whether 
the sample contains an organism containing a nucleic acid sequence of the invention 
(e.g., an organism from which the nucleic acid was isolated). Typically, the probes 
comprise oligonucleotides. In one embodiment, the amplification reaction may 
comprise a PGR reaction. PCR protocols are described in Ausubel and Sambrook, 

20 supra. Alternatively, the amplification may comprise a ligase chain reaction, 3SR, or 
strand displacement reaction. (See Barany, F,, *The Ligase Chain Reaction in a PCR 
World", PCR Methods and Applications 1:5-16, 1991; E. Fahy et al., "Self-sustained 
Sequence Replication (3SR): An Isothermal Transcription-based Amplification 
System Alternative to PCR", PCR Methods and Applications 1:25-33, 1991; and 

25 Walker G.T. et al, "Strand Displacement Amplification-an Isothermal in vitro DNA 
Amplification Technique", Nucleic Acid Research 20:1691-1696, 1992, the 
disclosures of which are incorporated herein by reference in their entireties). In such 
procedures, the nucleic acids in the sample are contacted with the probes, the 
amplification reaction is performed, and any resulting amplification product is 

30 detected. The amplification product may be detected by performing gel 

electrophoresis on the reaction products and staining the gel with an interculator such 
as ethidium bromide. Alternatively, one or more of the probes may be labeled with a 
radioactive isotope and the presence of a radioactive amplification product may be 
detected by autoradiography after gel electrophoresis. 
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5 Probes derived from sequences near the ends of the sequences of Group A 

nucleic acid sequences, and sequences substantially identical thereto, may also be 
used in chromosome walking procedures to identify clones containing genomic 
sequences located adjacent to the sequences of Group A nucleic acid sequences, and 
sequences substantially identical thereto. Such methods allow the isolation of genes 
10 which encode additional proteins from the host organism. 

The isolated nucleic acids of Group A nucleic acid sequences, and sequences 
substantially identical thereto, the sequences complementary thereto, or a fragment 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the sequences of Group A nucleic acid sequences, and 

1 5 sequences substantially identical thereto, or the sequences complementary thereto 
may be used as probes to identify and isolate related nucleic acids. In some 
embodiments, the related nucleic acids may be cDNAs or genomic DNAs from 
organisms other than the one from which the nucleic acid was isolated. For example, 
the other organisms may be related organisms. In such procedures, a nucleic acid 

20 sample is contacted with the probe under conditions which permit the probe to 
specifically hybridize to related sequences. Hybridization of the probe to nucleic 
acids from the related organism is then detected using any of the methods described 
above. 

In nucleic acid hybridization reactions, the conditions used to achieve a 
25 particular level of stringency will vary, depending on the nature of the nucleic acids 
being hybridized. For example, the length, degree of complementarity, nucleotide 
sequence composition (e.g., GC v. AT content), and nucleic acid type (e.g., RNA v. 
DNA) of the hybridizing regions of the nucleic acids can be considered in selecting 
hybridization conditions. An additional consideration is whether one of the nucleic 
30 acids is immobilized, for example, on a filter. 

Hybridization may be carried out under conditions of low stringency, 
moderate stringency or high stringency. As an example of nucleic acid hybridization, 
a polymer membrane containing immobilized denatured nucleic acids is first 
prehybridized for 30 minutes at 45*C in a solution consisting of 0.9 M NaCl, 50 mM 
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5 NaH2P04. pH 7.0, 5.0 mM Na2EDTA, 0.5% SDS, lOX Denhardf s, and 0,5 mg/ml 
polyriboadenylic acid. Approximately 2 X 107 cpm (specific activity 4-9 X 108 
cpm/ug) of ^^P end-labeled oligonucleotide probe are then added to the solution. 
After 12-16 hours of incubation, the membrane is washed for 30 minutes at room 
temperature in IX SET (150 mM NaCl, 20 mM Tris hydrochloride, pH 7.8, 1 mM 
10 Na2EDTA) containing 0.5% SDS, followed by a 30 minute wash in fresh IX SET at 
Tm-10"C for the oligonucleotide probe. The membrane is then exposed to auto- 
radiographic film for detection of hybridization signals. 

By varying the stringency of the hybridization conditions used to identify 
nucleic acids, such as cDNAs or genomic DNAs, which hybridize to the detectable 

1 5 probe, nucleic acids having different levels of homology to the probe can be identified 
and isolated. Stringency may be varied by conducting the hybridization at varying 
temperatures below the melting temperatures of the probes. The melting temperature, 
Tm, is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly complementary probe. Very stringent 

20 conditions are selected to be equal to or about 5**C lower than the Tm for a particular 
probe. The melting temperature of the probe may be calculated using the following 
formulas: 

For probes between 14 and 70 nucleotides in length the melting temperature 
(Tm) is calculated using the formula: Tm=81.5+16.6(log [Na+])+0.41 (fraction G+C)- 
25 (600/N) where N is the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the 
melting temperature may be calculated using the equation: Tm=81 .5+16.6(log 
[Na+])+0.41(fraction G+C)-(0.63% formamide).(600/N) where N is the length of the 
probe. 

30 Prehybridization may be carried out in 6X SSC. 5X Denhardt's reagent, 0.5% 

SDS, lOOjig denatured Augmented salmon sperm DNA or 6X SSC, 5X Denhardfs 
reagent, 0.5% SDS, lOOjxg denatured Augmented sahnon sperm DNA, 50% 
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5 formamide. The formulas for SSC and Denhardt's solutions are listed in Sambrook et 
al, supra. 

Hybridization is conducted by adding the detectable probe to the 
prehybridization solutions listed above. Where the probe comprises double stranded 
DNA, it is denatured before addition to the hybridization solution. The filter is 

10 contacted with the hybridization solution for a sufficient period of time to allow the 
probe to hybridize to cDNAs or genomic DNAs containing sequences complementary 
thereto or homologous thereto. For probes over 200 nucleotides in length, the 
hybridization may be carried out at 1 5-25''C below the Tm. For shorter probes, such 
as oligonucleotide probes, the hybridization may be conducted at 5-10*C below the 

15 Tm. Typically, for hybriidizations in 6X SSC, the hybridization is conducted at 
approximately 68°C. Usually, for hybridizations in 50% fonmamide containing 
solutions, the hybridization is conducted at approximately 42'C. 

All of the foregoing hybridizations would be considered to be under 
conditions of high stringency. 

20 Following hybridization, the filter is washed to remove any non-specifically 

bound detectable probe. The stringency used to wash the filters can also be varied 
depending on the nature of the nucleic acids being hybridized, the length of the 
nucleic acids being hybridized, the degree of complementarity, the nucleotide 
sequence composition (e.g., GC v. AT content), and the nucleic acid type (e.g., RNA 

25 V. DNA). Examples ofprogressively higher stringency condition washes are as 
follows: 2X SSC, 0.1% SDS at room temperature for 15 minutes (low stringency); 
O.IX SSC, 0.5% SDS at room temperature for 30 minutes to 1 hour (moderate 
stringency); O.IX SSC, 0.5% SDS for 15 to 30 minutes at between the hybridization 
temperature and 68**C (high stringency); and 0.1 5M NaCl for 15 minutes at 72^C 

30 (very high stringency). A final low stringency wash can be conducted in 0. 1 X SSC at 
room temperature. The examples above are merely illustrative of one set of 
conditions that can be used to wash filters. One of skill in the art would know that 
there are numerous recipes for different stringency washes. Some other examples are 
given below. 
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5 Nucleic acids which have hybridized to the probe are identified by 

autoradiography or other conventional techniques. 

The above procedure may be modified to identify nucleic acids having 
decreasing levels of homology to the probe sequence. For example, to obtain nucleic 
acids of decreasing homology to the detectable probe, less stringent conditions may 

10 be used. For example, the hybridization temperature may be decreased in increments 
of 5**C from 68**C to ATC in a hybridization buffer having a Na+ concentration of 
approximately IM. Following hybridization, the filter may be washed with 2X SSC, 
0,5% SDS at the temperature of hybridization. These conditions are considered to be 
"moderate" conditions above 50**C and "low" conditions below 50^C. A specific 

1 5 example of "moderate" hybridization conditions is when the above hybridization is 
conducted at 55°C. A specific example of "low stringency" hybridization conditions 
is when the above hybridization is conducted at 45**C. 

Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, 
containing formamide at a temperature of 42**C. In this case, the concentration of 

20 formamide in the hybridization buffer may be reduced in 5% increments from 50% to 
0% to identify clones having decreasing levels of homology to the probe. Following 
hybridization, the filter may be washed with 6X SSC, 0.5% SDS at 50^C. These 
conditions are considered to be "moderate" conditions above 25% formamide and 
"low" conditions below 25% formamide. A specific example of "moderate" 

25 hybridization conditions is when the above hybridization is conducted at 30% 

formamide. A specific example of "low stringency" hybridization conditions is when 
the above hybridization is conducted at 10% formamide. 

For example, the preceding methods may be used to isolate nucleic acids 
having a sequence with at least about 97%, at least 95%, at least 90%, at least 85%, at 
30 least 80%, at least 70%, at least 65%, at least 60%, at least 55%, or at least 50% 

homology to a nucleic acid sequence selected from the group consisting of one of the 
sequences of Group A nucleic acid sequences, and sequences substantially identical 
thereto, or fragments comprising at least about 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 
150, 200, 300, 400, or 500 consecutive bases thereof, and the sequences 
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5 complementary thereto. Homology may be measured using the aligranent algorithm. 
For example, the homologous polynucleotides may have a coding sequence which is a 
naturally occurring allelic variant of one of the coding sequences described herein. 
Such allelic variants may have a substitution, deletion or addition of one or more 
nucleotides when compared to the nucleic acids of Group A nucleic acid sequences or 
10 the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which 
encode polypeptides having at least about 99%, 95%, at least 90%, at least 85%, at 
least 80%, at least 75%, at least 70%, at least 65%, at least 60%, at least 55%, or at 
least 50% homology to a polypeptide having the sequence of one of Group B amino 
15 acid sequences, and sequences substantially identical thereto, or fragments comprising 
at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids 
thereof as determined using a sequence alignment algorithm (e.g., such as the FASTA 
version 3.0t78 algorithm with the default parameters). 

Another aspect of the invention is an isolated or purified polypeptide 
20 comprising the sequence of one of Group A nucleic acid sequences, and sequences 
substantially identical thereto, or jfragments comprising at least about 5, 10, 15, 20, 
25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof. As discussed 
above, such polypeptides may be obtained by inserting a nucleic acid encoding the 
polypeptide into a vector such that the coding sequence is operably linked to a 
sequence capable of driving the expression of the encoded polypeptide in a suitable 
host cell. For example, the expression vector may comprise a promoter, a ribosome 
binding site for translation initiation and a transcription temiinator. The vector may 
also include s^propriate sequences for amplifying expression. 

Promoters suitable for expressing the polypeptide or fragment thereof in 
bacteria include the E, coli lac or trp promoters, the /ac/ promoter, the lacZ promoter, 
the T3 promoter, the 77 promoter, the gpt promoter, the lambda P/? promoter, the 
lambda Pi promoter, promoters from operons encoding glycolytic enzymes such as 3- 
phosphoglycerate kinase (PGK), and the acid phosphatase promoter. Fungal 
promoters include the V factor promoter. Eukaryotic promoters include the CMV 
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5 immediate early promoter, the HSV thymidine kinase promoter, heat shock 

promoters, the early and late SV40 promoter, LTRs from retroviruses, and the mouse 
metallothionein-I promoter Other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses may also be used. 

Mammalian expression vectors may also comprise an origin of replication, 
10 any necessary ribosome binding sites, a polyadenylation site, splice donor and 

acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. In some embodiments, DNA sequences derived from the SV40 splice and 
polyadenylation sites may be used to provide the required nontranscribed genetic 
elements. 

1 5 Vectors for expressing the polypeptide or fragment thereof in eukaryotic cells 

may also contain enhancers to increase expression levels. Enhancers are cis-acting 
elements of DNA, usually from about 10 to about 300 bp in length that act on a 
promoter to increase its transcription. Examples include the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, the cytomegalovirus early promoter 

20 enhancer, the polyoma enhancer on the late side of the replication origin, and the 
adenovirus enhancers. 

In addition, the expression vectors typically contain one or more selectable 
marker genes to permit selection of host cells containing the vector. Such selectable 
markers include genes encoding dihydrofolate reductase or genes conferring 
25 neomycin resistance for eukaryotic cell culture, genes conferring tetracycline or 
ampicillin resistance in E. coli, and the S. cerevisiae TRPl gene. 

In some embodiments, the nucleic acid encoding one of the polypeptides of 
Group B amino acid sequences, and sequences substantially identical thereto, or 
fragments comprising at least about 5, 10, 15, 20, 25. 30, 35, 40, 50, 75, 100, or 150 
30 consecutive amino acids thereof is assembled in appropriate phase with a leader 
sequence capable of directing secretion of the translated polypeptide or fragment 
thereof. Optionally, the nucleic acid can encode a fusion polypeptide in which one of 
the polypeptides of Group B amino acid sequences, and sequences substantially 
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5 identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 
100, or 150 consecutive amino acids thereof is fused to heterologous peptides or 
polypeptides, such as N-terminal identification peptides which impart desired 
characteristics, such as increased stability or simplified purification. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
10 procedures. In general, the DNA sequence is ligated to the desired position in the 
vector following digestion of the insert and the vector with appropriate restriction 
endonucleases. Alternatively, blunt ends in both the insert and the vector may be 
ligated. A variety of cloning techniques are disclosed in Ausubel et al Current 
Protocols in Molecular Biology, John Wiley 503 Sons, Inc. 1997 and Sambrook et al., 
1 5 Molecular Cloning: A Laboratory Manual 2nd Ed., Cold Spring Harbor Laboratory 
Press (1989), the entire disclosures of which are incorporated herein by reference. 
Such procedures and others are deemed to be within the scope of those skilled in the 
art. 

The vector may be, for example, in the form of a plasmid, a viral particle, or a 
20 phage. Other vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, derivatives of SV40; bacterial plasmids, phage DNA, baculovirus, yeast 
plasmids, vectors derived from combinations of plasmids and phage DNA, viral DNA 
such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. A variety of cloning 
and expression vectors for use with prokaryotic and eukaryotic hosts are described by 
25 Sambrook, et al.. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, N.Y,, (1989), the disclosure of which is hereby incorporated by reference. 

Particular bacterial vectors which may be used include the commercially 
available plasmids comprising genetic elements of the well known cloning vector 
pBR322 (ATCC 37017), pKK223-3 (Phamacia Fine Chemicals, Uppsala, Sweden), 
30 GEMl (Promega Biotec, Madison, WI, USA) pQE70, pQE60, pQE-9 (Qiagen), 
pDlO, psiX174 pBluescript II KS, pNHSA, pNH16a, pNHlSA, pNH46A 
(Stratagene), ptrc99a, pKK223-3, pKK233.3, pDR540, pRIT5 (Pharmacia), pKK232- 
8 and pCM7. Particular eukaryotic vectors include pSV2CAT, pOG44, pXTl, pSG 
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5 (Stratagene) pSVK3, pBPV, pMSG, and pSVL (Pharmacia). However, any other 
vector may be used as long as it is rcplicable and viable in the host cell. 

The host cell may be any of the host cells familiar to those skilled in the art, 
including prokaryotic cells, eukaryotic cells, mammalian cells, insect cells, or plant 
cells. As representative examples of appropriate hosts, there may be mentioned: 

10 bacterial cells, such as E, coli, Streptomyces, Bacillus subtilis. Salmonella 

typhimurium and various species within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, fungal cells, such as yeast, insect cells such as Drosophila S2 and 
Spodoptera Sf9, animal cells such as CHO, COS or Bowes melanoma, and 
adenoviruses. The selection of an appropriate host is within the abilities of those 

IS skilled in the art. 

The vector may be introduced into the host cells using any of a variety of 
techniques, including transformation, transfection, transduction, viral infection, gene 
guns, or Ti-mediated gene transfer. Particular methods include calcium phosphate 
transfection, DEAE-Dextran mediated transfection, lipofection, or electroporation 
20 (Davis, L., Dibner, M., Battey, L, Basic Methods in Molecular Biology, (1986)). 

Where appropriate, the engineered host cells can be cultured in conventional 
nutrient media modified as q>propriate for activating promoters, selecting 
transformants or amplifying the genes of the invention. Following transformation of a 
suitable host strain and growth of the host strain to an appropriate cell density, the 
25 selected promoter may be induced by appropriate means (e.g., temperature shift or 
chemical induction) and the cells may be cultured for an additional period to allow 
them to produce the desired polypeptide or fragment thereof. 

Cells are typically harvested by centrifugation. dismpted by physical or 
chemical means, and the resulting crude extract is retained for further purification. 
30 Microbial cells employed for expression of proteins can be disrupted by any 

convenient method, including freeze-thaw cycling, sonication, mechanical disruption, 
or use of cell lysing agents. Such methods are well known to those skilled in the art. 
The expressed polypeptide or fragment thereof can be recovered and purified from 
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5 recombinant cell cultures by methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration of the 
10 polypeptide. If desired, high performance liquid chromatography (HPLC) can be 
employed for final purification steps. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts (described by Gluzman, Cell, 23:175, 1981), and 
1 5 other cell lines capable of expressing proteins fi-om a compatible vector, such as the 
C127, 3T3, CHO, HeLa and BHK cell lines. 

The constructs in host cells can be used in a conventional manner to produce 
the gene product encoded by the recombinant sequence. Depending upon the host 
employed in a recombinant production procedure, the polypeptides produced by host 
20 cells containing the vector may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may or may not also include an initial methionine 
amino acid residue. 

Alternatively, the polypeptides of Group B amino acid sequences, and 
sequences substantially identical thereto, or fiagments comprising at least 5, 10, 15, 
25 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof can be 

synthetically produced by conventional peptide synthesizers. In other embodiments, 
fiagments or portions of the polypeptides may be employed for producing the 
corresponding fiilMength polypeptide by peptide synthesis; therefore, the fragments 
may be employed as intermediates for producing the full-length polypeptides. 

30 Cell-free translation systems can also be employed to produce one of the 

polypeptides of Group B amino acid sequences, and sequences substantially identical 
thereto, or Augments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
1 50 consecutive amino acids thereof using mRNAs transcribed fi^m a DNA 
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5 cx)nstruct comprising a promoter operably linked to a nucleic acid encoding the 
polypeptide or fragment thereof. In some embodiments, the DNA construct may be 
linearized prior to conducting an in vitro transcription reaction. The transcribed 
mRNA is then incubated with an appropriate cell-free translation extract, such as a 
rabbit reticulocyte extract, to produce the desired polypeptide or fragment thereof. 

10 The invention also relates to variants of the polypeptides of Group B amino 

acid sequences, and sequences substantially identical thereto, or fragments comprising 
at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids 
thereof The term "variant" includes derivatives or analogs of these polypeptides. In 
particular, the variants may differ in amino acid sequence from the polypeptides of 

15 Group B amino acid sequences, and sequences substantially identical thereto, by one 
or more substitutions, additions, deletions, fiisions and truncations, which may be 
present in any combination. 

The variants may be naturally occurring or created in vitro. In particular, such 
variants may be created using genetic engineering techniques such as site directed 
20 mutagenesis, random chemical mutagenesis, Exonuclease III deletion procedures, and 
standard cloning techniques. Alternatively, such variants, fragments, analogs, or 
derivatives may be created using chemical synthesis or modification procedures. 

Other methods of making variants are also familiar to those skilled in the art. 
These include procedures in which nucleic acid sequences obtained from natural 

25 isolates are modified to generate nucleic acids which encode polypeptides having 
characteristics which enhance their value in industrial or laboratory applications. In 
such procedures, a large number of variant sequences having one or more nucleotide 
differences with respect to the sequence obtained from the natural isolate are 
generated and characterized. Typically, these nucleotide differences result in amino 

30 acid changes with respect to the polypeptides encoded by the nucleic acids from the 
natural isolates. 

For example, variants may be created using error prone PGR. In error prone 
PGR, PGR is performed under conditions where the copying fidelity of the DNA 
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5 polymerase is low, such that a high rate of point mutations is obtained along the entire 
length of the PCR product. Error prone PCR is described in Leung, D 'W., et al.. 
Technique, 1:11-15, 1989) and Caldwell, R. C. & Joyce G.F., PCR Methods Applic, 
2:28-33, 1992, the disclosure of which is incoiporated herein by reference in its 
entirety. Briefly, in such procedures, nucleic acids to be mutagenized are mixed with 

10 PCR primers, reaction buffer. MgC12. MnC12, Taq polymerase and an appropriate 
concentration of dNTPs for achieving a high rate of point mutation along the entire 
length of the PCR product. For example, the reaction may be perfomed using 20 
finoles of nucleic acid to be mutagenized, SOpmole of each PCR primer, a reaction 
buffer comprising 50mM KCl, lOmM Tris HCl (pH 8.3) and 0.01% gelatin. 7mM 

15 MgCI2, 0.5niM MnC12, 5 units of Taq polymerase, 0.2mM dGTP, 0.2mM dATP. 
ImM dCTP. and ImM dTTP. PCR may be perfomied for 30 cycles of 94° C for 1 
min, 45^ C for 1 min, and IT" C for 1 min. However, it will be ^preciated that these 
parameters may be varied as appropriate. The mutagenized nucleic acids are cloned 
into an appropriate vector and the activities of the polypq)tides encoded by the 

20 mutagenized nucleic acids is evaluated. 

Variants may also be created using oligonucleotide directed mutagenesis to 
generate site-specific mutations in any cloned DNA of interest. Oligonucleotide 
mutagenesis is described in Reidhaar-Olson, J.F. & Sauer, R.T., et al., Science, 
241:53-57, 1988, the disclosure of which is incoiporated herein by reference in its 
25 entirety. Briefly, in such procedures a plurality of double stranded oligonucleotides 
bearing one or more mutations to be introduced into the cloned DNA are synthesized 
and inserted into the cloned DNA to be mutagenized. Clones containing the 
mutagenized DNA are recovered and the activities of the polypeptides they encode 
are assessed. 

30 Another method for generating variants is assembly PCR. Assembly PCR 

involves the assembly of a PCR product fix)m a mixture of small DNA fragments. A 
large number of different PCR reactions occur in parallel in the same vial, with the 
products of one reaction priming the products of another reaction. Assembly PCR is 
described in U.S. Patent No. 5,965,408, filed July 9, 1996, entitled, **Method of DNA 
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5 Reassembly by Interrupting Synthesis", the disclosure of which is incorporated herein 
by reference in its entirety. 

Still another method of generating variants is sexual PCR mutagenesis. In 
sexual PCR mutagenesis, forced homologous recombination occurs between DNA 
molecules of different but highly related DNA sequence in vitro, as a result of random 
10 fragmentation of the DNA molecule based on sequence homology, followed by 
fixation of the crossover by primer extension in a PCR reaction. Sexual PCR 
mutagenesis is described in Stemmer, W.P., PNAS, USA, 91:10747-10751, 1994, the 
disclosure of which is incorporated herein by reference. Briefly, in such procedures a 
plurality of nucleic acids to be recombined are digested with DNAse to generate 
1 5 fragments having an average size of 50-200 nucleotides. Fragments of the desired 
average size are purified and resuspended in a PCR mixture, PCR is conducted under 
conditions which facilitate recombination between the nucleic acid fragments. For 
example, PCR may be performed by resuspending the purified fragments at a 
concentration of 10-30ng/:l in a solution of 0.2mM of each dNTP. 2.2mM MgC12. 
20 50mM KCL, lOmM Tris HCl. pH 9.0. and 0.1% Triton X-100. 2.5 units of Taq 
polymerase per 100:1 of reaction mixture is added and PCR is performed using the 
following regime: 94** C for 60 seconds, 94** C for 30 seconds. 50-55** C for 30 
seconds, 72° C for 30 seconds (30-45 times) and 72** C for 5 minutes. However, it 
will be appreciated that these parameters may be varied as appropriate. In some 
embodiments, oligonucleotides may be included in the PCR reactions. In other 
embodiments, the Klenow fragment of DNA polymerase I may be used in a first set of 
PCR reactions and Taq polymerase may be used in a subsequent set of PCR reactions. 
Recombinant sequences are isolated and the activities of the polypeptides they encode 
are assessed. 

Variants may also be created by in vivo mutagenesis. In some embodiments, 
random mutations in a sequence of interest are generated by propagating the sequence 
of interest in a bacterial strain, such as an £. coli strain, which carries mutations in 
one or more of the DNA repair pathways. Such "mutator" strains have a higher 
random mutation rate than that of a wild-type parent. Propagating the DNA in one of 
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5 these strains will eventually generate random mutations within the DNA. Mutator 
strains suitable for use for in vivo mutagenesis are described in PCT Publication No. 
WO 91/16427, published October 31, 1991, entitled "Methods for Phenotype Creation 
from Multiple Gene Populations" the disclosure of which is incorporated herein by 
reference m its entirety. 

10 Variants may also be generated using cassette mutagenesis. In cassette 

mutagenesis a small region of a double stranded DNA molecule is replaced with a 
synthetic oligonucleotide "cassette" that differs from the native sequence. The 
oligonucleotide often contains completely and/or partially randomized native 
sequence. 

1 5 Recursive ensemble mutagenesis may also be used to generate variants. 

Recursive ensemble mutagenesis is an algorithm for protein engineering (protein 
mutagenesis) developed to produce diverse populations of phenotypically related 
mutants whose members differ in amino acid sequence. This method uses a feedback 
mechanism to control successive rounds of combinatorial cassette mutagenesis. 

20 Recursive ensemble mutagenesis is described in Arkin, A.P. and Youvan, D.C., 

PNAS, USA, 89:781 1-7815, 1992, the disclosure of which is incorporated herein by 
reference in its entirety. 

In some embodiments, variants are created using exponential ensemble 
mutagenesis. Exponential ensemble mutagenesis is a process for generating 

25 combinatorial libraries with a high percentage of unique and functional mutants, 

wherein small groups of residues are randomized in parallel to identify, at each altered 
position, amino acids which lead to functional proteins. Exponential ensemble 
mutagenesis is described in Delegrave, S. and Youvan, D.C., Biotechnology 
Research, 1 1:1548-1552, 1993, the disclosure of which incorporated herein by 

30 reference in its entirety. Random and site-directed mutagenesis are described in 
Arnold, F.H., Current Opinion in Biotechnology, 4:450-455, 1993, the disclosure of 
which is incorporated herein by reference in its entirety. 
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5 In some embodiments, the variants are created using shuffling procedures 

wherein portions of a plurality of nucleic acids which encode distinct polypeptides are 
fused together to create chimeric nucleic acid sequences which encode chimeric 
polypeptides as described in U.S. Patent No. 5,965,408, filed July 9, 1996, entitled. 
"Method of DNA Reassembly by Interrupting Synthesis", and U.S. Patent No. 
10 5,939,250, filed May 22, 1996, entitled, "Production of Enzymes Having Desired 
Activities by Mutagenesis", both of which are incorporated herein by reference. 

The variants of the polypeptides of Group B amino acid sequences may be 
variants in which one or more of the amino acid residues of the polypeptides of the 
Group B amino acid sequences are substituted with a conserved or non-conserved 
15 amino acid residue (preferably a conserved amino acid residue) and such substituted 
amino acid residue may or may not be one encoded by the genetic code. 

Conservative substitutions are those that substitute a given amino acid in a 
polypeptide by another amino acid of like characteristics. Typically seen as 
conservative substitutions are the following replacements: replacements of an 
20 aliphatic amino acid such as Alanine, Valine, Leucine and Isoleucine with another 
aliphatic amino acid; replacement of a Serine with a Threonine or vice versa; 
replacement of an acidic residue such as Aspartic acid and Glutamic acid with another 
acidic residue; replacement of a residue bearing an amide group, such as Asparagine 
and Glutamine, with another residue bearing an amide group; exchange of a basic 
residue such as Lysine and Arginine with another basic residue; and replacement of 
an aromatic residue such as Phenylalanine, Tyrosine with another aromatic residue. 

Other variants are those in which one or more of the amino acid residues of 
the polypeptides of the Group B amino acid sequences includes a substituent group. 

Still other variants are those in which the polypeptide is associated with 
another compound, such as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol). 

Additional variants are those in which additional amino acids are fused to the 
polypeptide, such as a leader sequence, a secretory sequence, a proprotein sequence or 
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5 a sequence which facilitates purification, enrichment, or stabilization of the 
polypeptide. 

In some embodiments, the fragments, derivatives and analogs retain the same 
biological function or activity as the polypeptides of Group B amino acid sequences, 
and sequences substantially identical thereto. In other embodiments, the fragment, 
10 derivative, or analog includes a proprotein, such that the fragment, derivative, or 
analog can be activated by cleavage of the proprotein portion to produce an active 
polypeptide. 

Another aspect of the invention is polypeptides or fragments thereof which 
have at least about 50%, at least about 55%, at least about 60%, at least about 65%, at 

1 5 least about 70%, at least about 75%, at least about 80%, at least about 85%, at least 
about 90%, at least about 95%, or more than about 95% homology to one of the 
polypeptides of Group B amino acid sequences, and sequences substantially identical 
thereto, or a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
150 consecutive amino acids thereof. Homology may be determined using any of the 

20 programs described above which aligns the polypeptides or fragments being 

compared and determines the extent of amino acid identity or similarity between 
them. It will be appreciated that amino acid "homology" includes conservative amino 
acid substitutions such as those described above. 

The polypeptides or fragments having homology to one of the polypeptides of 
25 Group B amino acid sequences, and sequences substantially identical thereto, or a 
fragment comprising at least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 
consecutive amino acids thereof may be obtained by isolating the nucleic acids 
encoding them using the techniques described above. 

Alternatively, the homologous polypeptides or fragments may be obtained 
30 through biochemical enrichment or purification procedures. The sequence of 

potentially homologous polypeptides or fragments may be determined by proteolytic 
digestion, gel electrophoresis and/or microsequencing. The sequence of the 
prospective homologous polypeptide or firagment can be compared to one of the 
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5 polypeptides of Group B amino acid sequences, and sequences substantially identical 
thereto, or a fragment comprising at least about 5, 10, 15, 20, 25, 30, 35, 40, 50. 75, 
100, or 150 consecutive amino acids thereof using any of the programs described 
above. 

Another aspect of the invention is an assay for identifying fragments or 
10 variants of Group B amino acid sequences, and sequences substantially identical 
thereto, which retain the enzymatic function of the polypeptides of Group B amino 
acid sequences, and sequences substantially identical thereto. For example the 
fragments or variants of said polypeptides, may be used to catalyze biochemical 
reactions, which indicate that the fragment or variant retains the enzymatic activity of 
15 the polypeptides in the Group B amino acid sequences. 

The assay for determining if fragments of variants retain the enzymatic 
activity of the polypeptides of Group B amino acid sequences, and sequences 
substantially identical thereto includes the steps of: contacting the polypeptide 
fragment or variant with a substrate molecule under conditions which allow the 
20 polypeptide fragment or variant to function, and detecting either a decrease in the 
level of substrate or an increase in the level of the specific reaction product of the 
reaction between the polypeptide and substrate. 

The polypeptides of Group B amino acid sequences, and sequences 
substantially identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 
35, 40, 50, 75, 100, or 150 consecutive amino acids thereof may be used in a variety 
of applications. For example, the polypeptides or fragments thereof may be used to 
catalyze biochemical reactions. In accordance with one aspect of the invention, there 
is provided a process for utilizing the polypeptides of Group B amino acid sequences, 
and sequences substantially identical thereto or polynucleotides encoding such 
polypeptides for hydrolyzing glycosidic linkages. In such procedures, a substance 
containing a glycosidic linkage (e.g., a starch) is contacted with one of the 
polypeptides of Group B amino acid sequences, or sequences substantially identical 
thereto under conditions which facilitate the hydrolysis of the glycosidic linkage. 
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5 The polypeptides of Group B amino acid sequences, and sequences 

substantially identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 
35, 40, 50, 75, 100, or 150 consecutive amino acids thereof, may also be used in the 
liquefaction and saccharification of starch. Using the polypeptides or fragments 
thereof of this invention, liquefaction may be carried out at a lower pH than with 
10 previous enzymes. In one embodiment, liquefaction is performed at a pH of 4.5. 

Additionally, the polypeptides or fragments thereof of this invention are less calcium 
dependent than enzymes previously used in these processes. In liquefaction amylases 
are used to hydrolyze starch. In a preferred embodiment, the polypeptides or 
fragments thereof of this invention are thermostable at 90-95**C. 

^ 5 The polypeptides of Group B amino acid sequences, and sequences 

substantially identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 
35, 40, 50, 75, 100, or 150 consecutive amino acids thereof, may also be used to 
generate antibodies which bind specifically to the polypq)tidcs or fragments. The 
resulting antibodies may be used in immunoafiinity chromatography procedures to 

20 isolate or purify the polypeptide or to determine whether the polypeptide is present in 
a biological sample. In such procedures, a protein preparation, such as an extract, or a 
biological sample is contacted with an antibody enable of specifically binding to one 
of the polypeptides of Group B amino acid sequences, and sequences substantially 
identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40. 50, 75, 

25 1 00, or 1 50 consecutive amino acids thereof. 

In immunoaflBnity procedures, the antibody is attached to a solid support, such 
as a bead or other column matrix. The protein preparation is placed in contact with 
the antibody under conditions in which the antibody specifically binds to one of the 
polypeptides of Group B amino acid sequences, and sequences substantially identical 
30 thereto, or fragment thereof After a wash to remove non-specifically bound proteins, 
the specifically bound polypeptides are eluted. 

The ability of proteins in a biological sample to bind to the antibody may be 
determined using any of a variety of procedures familiar to those skilled in the art. 
For example, binding may be determined by labeling the antibody with a detectable 
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5 label such as a fluorescent agent, an enzymatic label, or a radioisotope. Alternatively, 
binding of the antibody to the sample may be detected using a secondary antibody 
having such a detectable label thereon. Particular assays include ELISA assays, 
sandwich assays, radioimmunoassays, and Western Blots. 

Polyclonal antibodies generated against the polypeptides of Group B amino 
10 acid sequences, and sequences substantially identical thereto, or fragments comprising 
at least 5, 10, 15. 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids 
thereof can be obtained by direct injection of the polypeptides into an animal or by 
administering the polypeptides to an animal, for example, a nonhuman. The antibody 
so obtained will then bind the polypeptide itself In this manner, even a sequence 
15 encoding only a fragment of the polypeptide can be used to generate antibodies which 
may bind to the whole native polypeptide. Such antibodies can then be used to isolate 
the polypeptide from cells expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include 

20 the hybridoma technique (Kohler and Milstein, Nature, 256:495-497, 1975, the 
disclosure of which is incorporated herein by reference), the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al.. Immunology Today 4:72, 1983, 
the disclosure of which is incorporated herein by reference), and the EBV-hybridoma 
technique (Cole, et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. 

25 Liss, Inc., pp. 77-96, the disclosure of which is incorporated herein by reference). 

Techniques described for the production of single chain antibodies (U.S. 
Patent No. 4,946,778, the disclosure of which is incorporated herein by reference) can 
be adapted to produce single chain antibodies to the polypeptides of Group B amino 
acid sequences, and sequences substantially identical thereto, or fragments comprising 
30 at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids 

thereof Alternatively, transgenic mice may be used to express humanized antibodies 
to these polypeptides or fragments thereof 
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5 Antibodies generated against the polypeptides of Groiq) B amino acid 

sequences, and sequences substantially identical thereto, or ftagments comprising at 
least 5, 10, 15, 20, 25, 30, 35, 40, 50. 75, 100, or 150 consecutive amino acids thereof 
may be used in screening for similar polypqjtides fiom oUier organisms and samples. 
In such techniques, polypeptides fiom the organism are contacted with the antibody 
10 and those polypeptides which specifically bind the antibody are detected. Any of the 
procedures described above may be used to detect antibody binding. One such 
screening assay is described in "Methods for Measuring Cellulase Activities", 
Methods in Enzymology, Vol 160, pp. 87-1 16, which is hereby incorporated by 
reference in its entirety. 

1 5 As used herein the term "nucleic acid sequoice as set forth in SEQ ID Nos.: 1 , 

3. 5, 9, 11,13, 15. 17, 19, 21, 23, 25, 27, 29. 31, 33, 35, 37, 39. 41. 43. 45 ,47, 49. 51. 
53. 55, 57. 59. 61. 63. 65, 67, 69. 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91. 93, 95. 97, 
99. 101. 103. 105. 107, 109, 111, 113, 115, 117, 119, 121. 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153. 155. 157, 159, 161, 163, 165. 

20 167, 169, 171, 173, 175. 177. 179. 181. 183, 185, 187. 189. 191. 193. 195. 197, 199. 
201, 203, 205, 207, 209. 21 1. 213. 215. 217, 219, 221, 223, 225. 227, 229. 231. 233. 
235, 237, 239. 241, 243, 245. 247. 249. 251, 253. 255. 257. 259, 261, 263. 265, 267, 
269, 271. 273. 275, 277, 279. 281, 283, 285, 287, 289, 291, 293, 295, 297. 299" 
encompasses the nucleotide sequences of Group A nucleic acid sequences, and 

25 sequences substantially identical thereto, as well as sequences homologous to Group 
A nucleic acid sequences, and fragments thereof and sequences complementary to all 
of the preceding sequences. The Augments include portions of SEQ ID Nos.: 1, 3, 5, 
9, 11. 13, 15, 17, 19. 21. 23. 25. 27. 29. 31. 33, 35, 37, 39, 41, 43, 45 .47, 49, 51, 53, 
55, 57. 59. 61. 63. 65. 67. 69. 71. 73, 75, 77, 79, 81, 83, 85. 87, 89, 91, 93. 95. 97, 99, 

30 101, 103, 105. 107, 109, 111, 113. 115, 117, 119, 121, 123, 125. 127, 129. 131, 133. 
135, 137, 139. 141, 143, 145. 147, 149. 151, 153, 155, 157, 159, 161, 163, 165, 167, 
169, 171, 173, 175. 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199.201, 
203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 
237, 239, 241, 243, 245, 247, 249, 251. 253, 255, 257. 259, 261, 263, 265, 267. 269, 

35 271, 273, 275, 277, 279, 281, 283. 285. 287. 289. 291. 293, 295, 297. and 299. 
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5 comprising at least 1 0, 1 5, 20, 25, 30, 35, 40, 50, 75, 100. 1 50, 200, 300. 400. or 500 
consecutive nucleotides of Group A nucleic acid sequences, and sequences 
substantially identical thereto. Homologous sequences and fiagments of Group A 
nucleic acid sequences, and sequoices substantially identical thereto, refer to a 
sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75%, 70%, 

1 0 65%, 60%, 55% or 50% homology to these sequences. Homology may be determined 
using any of the computer programs and parametos described herein, including 
FASTA version 3.0t78 with the default parameters. Homologous sequences also 
include RNA sequences in which uridines replace the thymines in the nucleic acid 
sequences as set forth in the Group A nucleic acid sequences. The homologous 

1 5 sequences may be obtained using any of the procedures described herein or may result 
from the correction of a sequencing error. It will be appreciated that the nucleic acid 
sequences as set forth in Group A nucleic acid sequences, and sequences substantially 
identical thereto, can be represented in the traditional single character format (See the 
inside back cover of Stryer, Lubert. Biochemistry, 3rd Ed., W. H Freeman & Co., 

20 New Yoric.) or in any other format \s*ich records the identity of the nucleotides in a 
sequence. 

As used herein the term "a polypq)tide sequence as set forth in SEQ ID Nos: 
2, 4, 6, 10, 12, 14, 16, 18. 20. 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 46. 48, 50, 
52, 54, 56. 58. 60. 62. 64. 66, 68, 70, 72, 74, 76, 78, 80, 82. 84, 86, 88. 90, 92. 94. 96. 

25 98, 100. 102, 104, 106, 108. 110. 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130. 
132. 134, 136. 138, 140. 142, 144. 146. 148. 150. 152. 154, 156, 158. 160, 162. 164, 
166, 168, 170, 172, 174, 176. 178, 180, 182, 184. 186, 188. 190. 192, 194, 196, 198, 
202, 204. 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226. 228, 230, 232, 234. 
236, 238. 240. 242, 244, 246, 248, 250. 252, 254, 256, 258, 260, 262, 264, 266, 268, 

30 270, 272, 274, 276, 278, 280, 282, 284, 286, 288. 290, 292, 294, 296, 298" 
encompasses the polypq)tide sequence of Group B amino acid sequences, and 
sequences substantially identical thereto, which are encoded by a sequence as set forth 
in SEQ ID Nos: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22. 24, 26, 28. 30, 32, 34, 36, 38, 40, 
42. 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 

35 88, 90, 92, 94. 96, 98, 100, 102, 104. 106, 108, 1 10, 1 12, 1 14. 1 16. 1 18. 120, 122, 
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5 124. 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166. 168, 170, 172, 174, 176, 178, 180, 182. 184, 186, 188, 190, 
192. 194, 196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220. 222. 224, 226, 
228, 230, 232, 234, 236, 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262, 264, 266, 268. 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292. 294, 

1 0 296, 298, polypeptide sequences homologous to the polypeptides of Group B amino 
acid sequences, and sequences substantially identical thereto, or fragments of any of 
the preceding sequences. Homologous polypeptide sequences refer to a polypeptide 
sequence having at least 99%, 98%, 97%, 96%, 95%, 90%. 85%, 80%, 75%, 70%, 
65%, 60%, 55% or 50% homology to one of the polypeptide sequences of the Group 

1 5 B amino acid sequences. Homology may be determined using any of the computer 
programs and parameters described herein, including FASTA version 3.0t78 with the 
default parameters or with any modified parameters. The homologous sequences may 
be obtained using any of the procedures described herein or may result fix)m the 
correction of a sequencing error. The polypeptide fragments comprise at least 5, 10, 

20 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of the polypeptides 
of Group B amino acid sequences, and sequences substantially identical thereto. It 
will be appreciated that the polypeptide codes as set forth in Group B amino acid 
sequences, and sequences substantially identical thereto, can be represented in the 
traditional single character fonnat or three letter format (See the inside back cover of 

25 Stryer, Lubert. Biochemistry, 3rd Ed., W. H Freeman & Co., New York.) or in any 
other fonnat which relates the identity of the polypeptides in a sequence. 

It will be appreciated by those skilled in the art that a nucleic acid sequence as 
set forth in SEQ ID No.s: 1, 3. 5. 9, 11, 13. 15. 17. 19, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 39, 41, 43. 45 ,47, 49. 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 
30 83, 85, 87, 89. 91, 93, 95, 97, 99, 101. 103, 105, 107, 109, 11 1, 1 13, 1 15. 1 17, 1 19, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143. 145, 147, 149, 151, 153. 
155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 
189, 191, 193, 195. 197, 199, 201, 203, 205, 207. 209, 211, 213, 215, 217, 219, 221, 
^ 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 
35 257, 259, 261, 263, 265, 267. 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
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5 291, 293, 295, 297, 299 and a polypeptide sequence as set forth in SEQ ID No.s: 2, 4, 
6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50. 52, 
54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88. 90, 92, 94, 96, 98, 
100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120. 122, 124. 126. 128. 130. 132, 
134, 136, 138, 140, 142. 144, 146, 148. 150. 152. 154. 156, 158. 160. 162. 164. 166. 
10 168, 170, 172, 174, 176, 178, 180. 182. 184. 186, 188, 190. 192, 194. 196, 198, 202, 
204, 206, 208, 210, 212. 214, 216, 218, 220, 222, 224. 226. 228. 230. 232, 234. 236. 
238. 240. 242. 244. 246, 248, 250. 252. 254, 256, 258. 260, 262, 264, 266, 268, 270, 
272. 274, 276, 278. 280. 282. 284, 286. 288, 290, 292. 294. 296, 298 can be stored, 
recorded, and manipulated on any medium which can be read and accessed by a 
15 computer. As used herein, the words •^recorded" and "stored" refer to a process for 
storing information on a computer medium. A skilled artisan can readily adopt any of 
the presently known methods for recording information on a computer readable 
medium to generate manufactures comprising one or more of the nucleic acid 
sequences as set forth in Group A nucleic acid sequences, and sequences substantially 
20 identical thereto, one or more of the polypeptide sequences as set forth in Group B 
amino acid sequences, and sequences substantially identical thereto. Another aspect 
of the invention is a computer readable medium having recorded thereon at least 2, 5, 
10. 15, or 20 nucleic acid sequences as set forth in Group A nucleic acid sequences, 
and sequences substantially identical thereto. 

Another aspect of the invention is a computer readable medium having 
recorded thereon one or more of the nucleic acid sequences as set forth in Group A 
nucleic acid sequences, and sequences substantially identical thereto. Another aspect 
of the invention is a computer readable medium having recorded thereon one or more 
of the polypeptide sequences as set forth in Group B amino acid sequences, and 
sequences substantially identical thereto. Another aspect of the invention is a 
computer readable medium having recorded thereon at least 2. 5, 10, 15, or 20 of the 
sequences as set forth above. 



Computer readable media include magnetically readable media, optically 
readable media, electronically readable media and magnetic/optical media. For 
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5 example, the computer readable media may be a hard disk, a floppy disk, a magnetic 
tape, CD-ROM, Digital Versatile Disk (DVD), Random Access Memory (RAM), or 
Read Only Memory (ROM) as well as other types of other media known to those 
skilled in the art. 

Embodiments of the invention include systems (e.g., internet based systems), 
10 particularly computer systems which store and manipulate the sequence information 
described herein. One example of a computer system 100 is illustrated in block 
diagram form in Figure 1. As used herein, "a computer system" refers to the 
hardware components, software components, and data storage components used to 
analyze a nucleotide sequence of a nucleic acid sequence as set forth in Group A 
15 nucleic acid sequences, and sequences substantially identical thereto, or a polypeptide 
sequence as set forth in the Group B amino acid sequences. The computer system 100 
typically includes a processor for processing, accessing and manipulating the 
sequence data. The processor 105 can be any well-known type of central processing 
unit, such as, for example, the Pentium III from Intel Corporation, or similar 
20 processor from Sun, Motorola, Compaq, AMD or International Business Machines. 

Typically the computer system 100 is a general purpose system that comprises 
the processor 105 and one or more internal data storage components 110 for storing 
data, and one or more data retrieving devices for retrieving the data stored on the data 
storage components. A skilled artisan can readily appreciate that any one of the 
25 currently available computer systems are suitable. 

In one particular embodiment, the computer system 100 includes a processor 
105 connected to a bus which is connected to a main memory 1 1 5 (preferably 
implemented as RAM) and one or more internal data storage devices 1 10, such as a 
hard drive and/or other computer readable media having data recorded thereon. In 
30 some embodiments, the computer system 100 further includes one or more data 
retrieving device 1 1 8 for reading the data stored on the internal data storage devices 
110. 
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5 The data retrieving device 1 1 8 may represent, for example, a floppy disk 

drive, a compact disk drive, a magnetic tape drive, or a modem capable of comiection 
to a remote data storage system (e.g., via the internet) etc. In some embodiments, the 
internal data storage device 1 10 is a removable computer readable medium such as a 
floppy disk, a compact disk, a magnetic tape, etc. containing control logic and/or data 
10 recorded thereon. The computer system 100 may advantageously include or be 
programmed by appropriate software for reading the control logic and/or the data 
from the data storage component once mserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display 
output to a computer user. It should also be noted that the computer system 100 can 
1 5 be linked to other computer systems 125a-c in a network or wide area network to 
provide centralized access to the computer system 100. 

Software for accessing and processing the nucleotide sequences of a nucleic 
acid sequence as set forth in Group A nucleic acid sequences, and sequences 
substantially identical thereto, or a polypeptide sequence as set forth in Group B 
20 amino acid sequences, and sequences substantially identical thereto, (such as search 
tools, compare tools, and modeling tools etc.) may reside in main memory 115 during 
execution. 

In some embodiments, the computer system 100 may fiirther comprise a 
sequence comparison algorithm for comparing a nucleic acid sequence as set forth in 

25 Group A nucleic acid sequences, and sequences substantially identical thereto, or a 
polypeptide sequence as set forth in Group B amino acid sequences, and sequences 
substantially identical thereto, stored on a computer readable medium to a reference 
nucleotide or polypeptide sequence(s) stored on a computer readable medium. A 
^'sequence comparison algorithm" refers to one or more programs which are 

30 implemented (locally or remotely) on the computer system 100 to compare a 

nucleotide sequence with other nucleotide sequences and/or compounds stored within 
a data storage means. For example, the sequence comparison algorithm may compare 
the nucleotide sequences of a nucleic acid sequence as set forth in Group A nucleic 
acid sequences, and sequences substantially identical thereto, or a polypeptide 



wo 02/068589 



PCT/US02/05068 



75 

5 sequence as set forth in Group B amino acid sequences, and sequences substantially 
identical thereto, stored on a computer readable medium to reference sequences stored 
on a computer readable medium to identify homologies or structural motifs. Various 
sequence comparison programs identified elsewhere in this patent specification are 
particularly contemplated for use in this aspect of the invention. Protein and/or 

1 0 nucleic acid sequence homologies may be evaluated using any of the variety of 

sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, 
TFASTA, and CLUSTALW (Pearson and Lipman, Proc. Natl. Acad. Sci. USA 
85(8):2444.2448, 1988; Altschul et al., J. Mol. Biol. 215(3):403-410, 1990; 

1 5 Thompson et al. Nucleic Acids Res. 22(2):4673-4680, 1994; Higgins et al., Methods 
Bnzymol. 266:383-402, 1996; Altschul et al., J. Mol. Biol. 215(3):403-410, 1990; 
Altschul et al, Nature Genetics 3:266-272, 1993). 

Homology or identity is often measured using sequence analysis software 
(e.g.. Sequence Analysis Software Package of the Genetics Computer Group, 

20 University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705). Such software matches similar sequences by assigning degrees of 
homology to various deletions, substitutions and other modifications. The terms 
"homology*' and "identity" in the context of two or more nucleic acids or polypeptide 
sequences, refer to two or more sequences or subsequences that are the same or have a 

25 specified percentage of amino acid residues or nucleotides that are the same when 
compared and aligned for maximum correspondence over a comparison window or 
designated region as measured using any number of sequence comparison algorithms 
or by manual alignment and visual inspection. 

For sequence comparison, typically one sequence acts as a reference sequence, 
30 to which test sequences are compared. When usmg a sequence comparison algorithm, 
test and reference sequences are entered into a computer, subsequence coordinates are 
designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. 
The sequence comparison algorithm then calculates the percent sequence identities for 
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5 the test sequences relative to the reference sequence, based on the program 
parameters. 

A "comparison window'*, as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 

10 which a sequence may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. Methods of 
alignment of sequence for comparison are well-known in the art. Optimal alignment 
of sequences for comparison can be conducted, e.g., by the local homology algorithm 
of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment 

15 algorithm of Needleman & Wunsch, J. Mol. Biol 48:443, 1970, by the search for 

similarity method of person & Lipman, Proc. Nat'l Acad. Sci. USA 85:2444, 1988, by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 
575 Science Dr., Madison, WI), or by manual alignment and visual inspection. Other 

20 algorithms for determining homology or identity include, for example, in addition to a 
BLAST program (Basic Local Alignment Search Tool at the National Center for 
Biological Information), ALIGN, AMAS (Analysis of Multiply Aligned Sequences), 
AMPS (Protein Multiple Sequence Alignment), ASSET (Aligned Segment Statistical 
Evaluation Tool), BANDS, BESTSCOR, BIOSCAN (Biological Sequence 

25 Comparative Analysis Node), BLIMPS (BLocks IMProved Searcher), FASTA, 
Intervals & Points, BMB, CLUSTAL V, CLUSTAL W, CONSENSUS, 
LCONSENSUS, WCONSENSUS, Smith-Waterman algorithm, DARWIN, Las Vegas 
algorithm, FNAT (Forced Nucleotide Alignment Tool), Framealign, Framesearch, 
DYNAMIC, FILTER, FSAP (Fristensky Sequence Analysis Package), GAP (Global 

30 Alignment Program), GENAL, GIBBS, GenQuest, ISSC (Sensitive Sequence 

Comparison), LALIGN (Local Sequence Alignment), LCP (Local Content Program), 
MACAW (Multiple Alignment Construction & Analysis Workbench), MAP 
(Multiple Alignment Program), MBLKP, MBLKN, PIMA (Pattern-Induced Multi- 
sequence Alignment), SAGA (Sequence Alignment by Genetic Algorithm) and 

35 WHAT-IF. Such alignment programs can also be used to screen genome databases to 
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5 identify polynucleotide sequences having substantially identical sequences. A number 
of genome databases are available, for example, a substantial portion of the human 
genome is available as part of the Human Genome Sequencing Project (J. Roach, 
http://weber.u. Washington.edu/-roach/human_ genome^ progress 2.html) (Gibbs, 
1995). At least twenty-one other genomes have akeady been sequenced, including, 

10 for example, M. genitalium (Eraser et al, 1 995), M. jannaschii (Bult et al., 1 996), H. 
influenzae (Fleischmann et al, 1995), E. coli (Blattner et al„ 1997), and yeast (S. 
cerevisiae) (Mcwes et al., 1997), and D. melanogaster (Adams et al., 2000). 
Significant progress has also been made in sequencing the genomes of model 
organism, such as mouse, C. elegans, and Arabadopsis sp. Several databases 

1 5 containing genomic information annotated with some functional information are 

maintained by different organization, and are accessible via the internet, for example, 
http://wwwtigr.org/tdb; http://www.genetics.wisc.edu; http://genome- 
www.stanford.edu/-ball; http://hiv-web.lanl.gov; http://www.ncbi.nlm.mh.gov; 
http://www.ebi.ac.uk; http://Pasteur.fiyother/biology; and http:// 

20 www.genome.wi.mit.edu. 

One example of a useful algorithm is BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al., Nuc. Acids Res. 25:3389-3402, 1977, and 
Altschul et al., J. Mol. Biol 215:403-410, 1990, respectively. Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves 
first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et al, 
supra). These initial neighborhood word hits act as seeds for initiating searches to 
find longer HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always >0). For amino acid sequences, 
a scoring matrix is used to calculate the cumulative score. Extension of the word hits 
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5 in each direction are halted when: the cumulative alignment score falls offby the 
quantity X from its maximum achieved value; the cumulative score goes to zero or 
below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T, and 
X determine the sensitivity and speed of the alignment. The BLASTN program (for 

10 nucleotide sequences) uses as defaults a wordlengfli (W) of 1 1, an expectation (E) of 
10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, and expectations (E) of 10, and 
the BLOSUM62 scoring matrix (see Henikoff & HenikoflF, Proc. Natl. Acad. Sci. 
USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N= -4, and a 

1 5 comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Natl. Acad. Sci. USA 
90:5873, 1993). One measure of similarity provided by BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication of the probability by 
20 which a match between two nucleotide or amino acid sequences would occur by 

chance. For example, a nucleic acid is considered similar to a references sequence if 
the smallest sum probability in a comparison of the test nucleic acid to the reference 
nucleic acid is less than about 0.2, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

25 In one embodiment, protein and nucleic acid sequence homologies are 

evaluated using the Basic Local Alignment Search Tool ("BLAST") In particular, five 
specific BLAST programs are used to perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence 
against a protein sequence database; 



30 



(2) BLASTN compares a nucleotide query sequence against a 
nucleotide sequence database; 
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5 (3) BLASTX compares the six-frame conceptual translation 

products of a query nucleotide sequence (both strands) against a protein sequence 
database; 

(4) TBLASTN compares a query protein sequence against a 
nucleotide sequence database translated in all six reading frames (both strands); and 

^ ^ (5) TBLASTX compares the six-frame translations of a nucleotide 

query sequence against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs " between a 
query amino or nucleic acid sequence and a test sequence which is preferably 

1 5 obtained from a protein or nucleic acid sequence database. High-scoring segment 
pairs are preferably identified (i.e., aligned) by means of a scoring matrix, many of 
which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 
matrix (Gonnet et al„ Science 256:1443-1445, 1992; Henikoffand Henikoff, Proteins 
17:49-61, 1993). Less preferably, the PAM or PAM250 matrices may also be used 

20 (see, e.g., Schwartz and Da>4ioflF, eds., 1978, Matrices for Detecting Distance 
Relationships: Atlas ofProtein Sequence and Structure, Washington: National 
Biomedical Research Foundation). BLAST programs are accessible through the U.S. 
National Library of Medicine, e.g., at www.ncbi.nlm.nih.gov. 

The parameters used with the above algorithms may be adapted depending on 
25 the sequence length and degree of homology studied. In some embodiments, the 
parameters may be the default parameters used by the algorithms in the absence of 
instructions from the user. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for 
comparing a new nucleotide or protein sequence with a database of sequences in order 
30 to determine the homology levels between the new sequence and the sequences in the 
database. The database of sequences can be a private database stored within the 
computer system 100, or a public database such as GENBANK that is available 
through the Internet. 
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5 The process 200 begins at a start state 201 and then moves to a state 202 

wherein the new sequence to be compared is stored to a memory in a computer 
system 100. As discussed above, the memory could be any type of memory, 
including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is 
10 opened for analysis and comparison. The process 200 then moves to a state 206 
wherein the first sequence stored in the database is read into a memory on the 
computer. A comparison is then perfomed at a state 210 to determine if the first 
sequence is the same as the second sequence. It is important to note diat this step is 
not limited to performing an exact comparison between the new sequence and the first 
1 5 sequence in the database. Well-known methods are known to those of skill in the art 
for comparing two nucleotide or protein sequences, even if they are not identical. For 
example, gaps can be introduced into one sequence in order to raise the homology 
level between the two tested sequences. The parameters that control whether gaps or 
other features are introduced into a sequence during comparison are normally entered 
20 by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, 
a determination is made at a decision state 210 whether the two sequences are the 
same. Of course, the term "same" is not limited to sequences that are absolutely 
identical. Sequences that are within the homology parameters entered by the user will 
25 be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 
200 moves to a state 214 wherein the name of the sequence from the database is 
displayed to the user. This state notifies the user that the sequence with the displayed 
name fiilfills the homology constraints that were entered. Once the name of the stored 
30 sequence is displayed to the user, the process 200 moves to a decision state 218 

wherein a determination is made whether more sequences exist in the database. If no 
more sequences exist in the database, then the process 200 terminates at an end state 
220. However, if more sequences do exist in the database, then the process 200 
moves to a state 224 wherein a pointer is moved to the next sequence in the database 
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5 so that it can be compared to the new sequence. In this manner, the new sequence is 
ahgned and compared with every sequence in the database. 

It should be noted that if a determination had been made at the decision state 
212 that the sequences were not homologous, then the process 200 would move 
immediately to the decision state 218 in order to determine if any other sequences 
1 0 were available in the database for comparison. 

Accordingly, one aspect of the invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid sequence as set 
forth in Group A nucleic acid sequences, and sequences substantially identical 
thereto, or a polypq)tide sequence as set forth in Group B amino acid sequences, and 

1 5 sequences substantially identical thereto, a data storage device having retrievably 
stored thereon reference nucleotide sequences or polypeptide sequences to be 
compared to a nucleic acid sequence as set forth in Group A nucleic acid sequences, 
and sequences substantially identical thereto, or a polypeptide sequence as set forth in 
Group B amino acid sequences, and sequences substantially identical thereto, and a 

20 sequence comparer for conducting the comparison. The sequence comparer may 
indicate a homology level between the sequences compared or identify structural 
motifs in the above described nucleic acid code of Group A nucleic acid sequences, 
and sequences substantially identical thereto, or a polypeptide sequence as set forth in 
Group B amino acid sequences, and sequences substantially identical thereto, or it 

25 may identify structural motifs in sequences which are compared to these nucleic acid 
codes and polypeptide codes. In some embodiments, the data storage device may 
have stored thereon the sequences of at least 2, 5, 10, 15, 20, 25, 30 or 40 or more of 
the nucleic acid sequences as set forth in Group A nucleic acid sequences, and 
sequences substantially identical thereto, or the polypeptide sequences as set forth in 

30 Group B amino acid sequences, and sequences substantially identical thereto. 

Another aspect of the invention is a method for determining the level of 
homology between a nucleic acid sequence as set forth in Group A nucleic acid 
sequences, and sequences substantially identical thereto, or a polypeptide sequence as 
set forth in Group B amino acid sequences, and sequences substantially identical 
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5 thereto, and a reference nucleotide sequence. The method including reading the 
nucleic acid code or the polypeptide code and the reference nucleotide or polypeptide 
sequence through the use of a computer program which determines homology levels 
and determining homology between the nucleic acid code or polypeptide code and the 
reference nucleotide or polypeptide sequence with the computer program. The 

1 0 computer program may be any of a number of computer programs for determining 
homology levels, including those specifically enumerated herein, (e.g., BLAST2N 
with the default parameters or with any modified parameters). The method may be 
implemented using the computer systems described above. The method may also be 
performed by reading at least 2, 5, 10, 15, 20, 25, 30 or 40 or more of the above 

15 described nucleic acid sequences as set forth in the Group A nucleic acid sequences, 
or the polypeptide sequences as set forth in the Group B amino acid sequences 
through use of the computer program and determining homology between the nucleic 
acid codes or polypeptide codes and reference nucleotide sequences or polypeptide 
sequences. 

20 Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a 

computer for determining whether two sequences are homologous. The process 250 
begins at a start state 252 and then moves to a state 254 wherein a first sequence to be 
compared is stored to a memory. The second sequence to be compared is then stored 
to a memory at a state 256. The process 250 then moves to a state 260 wherein the 

25 first character in the first sequence is read and then to a state 262 wherein the first 
character of the second sequence is read. It should be understood that if the sequence 
is a nucleotide sequence, then the character would normally be either A, T, C, G or U. 
If the sequence is a protein sequence, then it is preferably in the single letter amino 
acid code so that the first and sequence sequences can be easily compared. 

30 A determination is then made at a decision state 264 whether the two 

characters are the same. If they are the same, then the process 250 moves to a state 
268 wherein the next characters in the first and second sequences are read. A 
determination is then made whether the next characters are the same. If they are, then 
the process 250 continues this loop imtil two characters are not the same. If a 
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5 determination is made that the next two characters are not the same, the process 250 
moves to a decision state 274 to detennine whether there arc any more characters 
either sequence to read. 

If there are not any more characters to read, then the process 250 moves to a 
state 276 wherein the level of homology between the first and second sequences is 
10 displayed to the user. The level of homology is determined by calculating the 

proportion of characters between the sequences that were the same out of the total 
number of sequences in the first sequence. Thus, if every character in a first 100 
nucleotide sequence aligned with a every character in a second sequence, the 
homology level would be 1 00%. 

1 5 Alternatively, the computer program may be a computer program which 

compares the nucleotide sequences of a nucleic acid sequence as set forth in the 
invention, to one or more reference nucleotide sequences in order to determine 
whether the nucleic acid code of Group A nucleic acid sequences, and sequences 
substantially identical thereto, differs from a reference nucleic acid sequence at one or 

20 more positions. Optionally such a program records the length and identity of inserted, 
deleted or substituted nucleotides with respect to the sequence of either the reference 
polynucleotide or a nucleic acid sequence as set forth in Group A nucleic acid 
sequences, and sequences substantially identical thereto. In one embodiment, the 
computer program may be a program which determines whether a nucleic acid 

25 sequence as set forth in Group A nucleic acid sequences, and sequences substantially 
identical thereto, contains a single nucleotide polymorphism (SNP) with respect to a 
reference nucleotide sequence. 

Accordingly, another aspect of the invention is a method for determining 
whether a nucleic acid sequence as set forth in Group A nucleic acid sequences, and 
30 sequences substantially identical thereto, differs at one or more nucleotides from a 
reference nucleotide sequence comprising the steps of reading the nucleic acid code 
and the reference nucleotide sequence through use of a computer program which 
identifies differences between nucleic acid sequences and identifying differences 
between the nucleic acid code and the reference nucleotide sequence with the 
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5 computer program. In some embodiments, the computer program is a program which 
identifies single nucleotide polymorphisms. The method may be implemented by the 
computer systems described above and the method illustrated in Figure 3. The method 
may also be performed by reading at least 2, 5, 10, 15, 20, 25, 30, or 40 or more of the 
nucleic acid sequences as set forth in Group A nucleic acid sequences, and sequences 
1 0 substantially identical thereto, and the reference nucleotide sequences through the use 
of the computer program and identifying differences between the nucleic acid codes 
and the reference nucleotide sequences with the computer program. 

In other embodiments the computer based system may further comprise an 
identifier for identifying features within a nucleic acid sequence as set forth in the 
1 5 Group A nucleic acid sequences or a polypeptide sequence as set forth in Group B 
amino acid sequences, and sequences substantially identical thereto. 

An "identifier" refers to one or more programs which identifies certain 
features within a nucleic acid sequence as set forth in Group A nucleic acid 
sequences, and sequences substantially identical thereto, or a polypeptide sequence as 
20 set forth in Group B amino acid sequences, and sequences substantially identical 

thereto. In one embodiment, the identifier may comprise a program which identifies 
an open reading fi-ame in a nucleic acid sequence as set forth in Group A nucleic acid 
sequences, and sequences substantially identical thereto. 

Figure 5 is a flow diagram illustrating one embodiment of an identifier process 
25 300 for detecting the presence of a feature in a sequence. The process 300 begins at a 
start state 302 and then moves to a state 304 wherein a first sequence that is to be 
checked for features is stored to a memory 1 15 in the computer system 100. The 
process 300 then moves to a state 306 wherein a database of sequence features is 
opened. Such a database would include a list of each feature's attributes along with 
30 the name of the feature. For example, a feature name could be "Initiation Codon'* and 
the attribute would be "ATG**. Another example would be the feature name 
*TAATAA Box" and the feature attribute would be "TAATAA". An example of 
such a database is produced by the University of Wisconsin Genetics Computer 
Group (www.gcg.com). Alternatively, the features may be structural polypeptide 
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5 motifs such as alpha helices, beta sheets, or functional polypeptide motifs such as 
enzymatic active sites, helix-tum-helix motifs or other motifs known to those skilled 
in the art. 

Once the database of features is opened at the state 306, the process 300 
moves to a state 308 wherein the first feature is read from the database. A 
10 comparison of the attribute of the first feature with the firet sequence is then made at a 
state 310. A determination is then made at a decision state 316 whether the attribute 
of the feature was found in the first sequence. If the attribute was found, then the 
process 300 moves to a state 318 wherein the name of the found feature is displayed 
to the user. 

1 5 The process 300 then moves to a decision state 320 wherein a determination is 

made whether move features exist in the database. If no more features do exist, then 
the process 300 terminates at an end state 324. However, if more features do exist in 
the database, then the process 300 reads the next sequence feature at a state 326 and 
loops back to the state 310 wherein the attribute of the next feature is compared 

20 against the furst sequence. 

It should be noted, that if the feature attribute is not found in the first sequence 
at the decision state 316, the process 300 moves directly to the decision state 320 in 
order to determine if any more features exist in the database. 

Accordingly, another aspect of the invention is a method of identifying a 
25 feature within a nucleic acid sequence as set forth in Group A nucleic acid sequences, 
and sequences substantially identical thereto, or a polypeptide sequence as set forth in 
Group B amino acid sequences, and sequences substantially identical thereto, 
comprising reading the nucleic acid code(s) or polypeptide code(s) through the use of 
a computer program which identifies features therein and identifying features within 
30 the nucleic acid code(s) with the computer program. In one embodiment, computer 
program comprises a computer program which identifies open reading firames. The 
method may be performed by reading a single sequence or at least 2, 5, 10, 15, 20, 25, 
30, or 40 of the nucleic acid sequences as set forth in Group A nucleic acid sequences. 
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5 and sequences substantially identical thereto, or the polypeptide sequences as set forth 
in Group B amino acid sequences, and sequences substantially identical thereto, 
through the use of the computer program and identifying features within the nucleic 
acid codes or polypeptide codes with the computer program. 

A nucleic acid sequence as set forth in Group A nucleic acid sequences, and 

10 sequences substantially identical thereto, or a polypeptide sequence as set forth in 
Group B amino acid sequences, and sequences substantially identical thereto, may be 
stored and manipulated in a variety of data processor programs in a variety of formats. 
For example, a nucleic acid sequence as set forth in Group A nucleic acid sequences, 
and sequences substantially identical thereto, or a polypeptide sequence as set forth in 

15 Group B amino acid sequences, and sequences substantially identical thereto, may be 
stored as text in a word processing file, such as MicrosoftWORD or 
WORDPERFECT or as an ASCII file in a variety of database programs familiar to 
those of skill in the art, such as DB2, SYBASE, or ORACLE. In addition, many 
computer programs and databases may be used as sequence comparison algorithms, 

20 identifiers, or sources of reference nucleotide sequences or polypeptide sequences to 
be compared to a nucleic acid sequence as set forth in Group A nucleic acid 
sequences, and sequences substantially identical thereto, or a polypeptide sequence as 
set forth in Group B amino acid sequences, and sequences substantially identical 
thereto. The following list is intended not to limit the invention but to provide 

25 guidance to programs and databases which are useful with the nucleic acid sequences 
as set forth in Group A nucleic acid sequences, and sequences substantially identical 
thereto, or the polypeptide sequences as set forth in Group B amino acid sequences, 
and sequences substantially identical thereto. 

The programs and databases which may be used include, but are not limited 
30 to: MacPattem (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine 
(Molecular Applications Group), Look (Molecular Applications Group), MacLook 
(Molecular Applications Group), BLAST and BLAST2 (NCBI), BLASTN and 
BLASTX (Altschul et al, J. Mol. Biol. 215: 403, 1990), FASTA (Pearson and 
Lipman, Proc, Natl. Acad. Sci. USA, 85: 2444, 1988). FASTDB (Brutlag et al. 
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5 Comp. App. Biosci. 6:237-245, 1990), Catalyst (Molecular Simulations Inc.), 
Catalyst/SHAPE (Molecular Simulations Inc.), Cerius2.DBAccess (Molecular 
Simulations Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular 
Simulations Inc.), Discover (Molecular Simulations Inc.), CHARMm (Molecular 
Simulations Inc.), Felix (Molecular Simulations Inc.), DelPhi, (Molecular Simulations 

1 0 Inc.), QuanteMM, (Molecular Simulations Inc.), Homology (Molecular Simulations 
Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular Simulations Inc.), 
Quanta/Protein Design (Molecular Simulations Inc.), WebLab (Molecular 
Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene 
Explorer (Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the 

1 5 MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwent's World Drug Index 
database, the BioByteMasterFile database, the (jenbank database, and the Genseqn 
database. Many other programs and data bases would be apparent to one of skill in 
the art given the present disclosure. 

20 Motifs which may be detected using the above programs include sequences 

encoding leucine zippers, helix-tum-helix motifs, glycosylation sites, ubiquitination 
sites, alpha helices, and beta sheets, signal sequences encoding signal peptides which 
direct the secretion of the encoded proteins, sequences implicated in transcription 
regulation such as homeoboxes, acidic stretches, enzymatic active sites, substrate 

25 binding sites, and enzymatic cleavage sites. 

The present invention exploits the unique catalytic properties of enzymes. 
Whereas the use of biocatalysts (i.e., purified or crude enzymes, non-living or living 
cells) in chemical transformations nomially requires the identification of a particular 
biocatalyst that reacts with a specific starting compound, the present invention uses 
30 selected biocatalysts and reaction conditions that are specific for fiinctional groups 
that are present in many starting compounds, such as small molecules. Each 
biocatalyst is specific for one functional group, or several related fiinctional groups, 
and can react with many starting compounds containing this functional group. 
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5 The biocatalytic reactions produce a population of derivatives from a single 

starting compound. These derivatives can be subjected to another round of 
biocatalytic reactions to produce a second population of derivative compounds. 
Thousands of variations of the original small molecule or compound can be produced 
with each iteration of biocatalytic derivatization. 

1 0 Enzymes react at specific sites of a starting compound without affecting the 

rest of the molecule, a process which is very difficult to achieve using traditional 
chemical methods. This high degree of biocatalytic specificity provides the means to 
identify a single active compound within the library. The library is characterized by 
the series of biocatalytic reactions used to produce it, a so called *T>iosynthetic 

1 5 history". Screening the library for biological activities and tracing the biosynthetic 
history identifies the specific reaction sequence producing the active compound. The 
reaction sequence is repeated and the structure of the synthesized compound 
determined. This mode of identification, unlike other synthesis and screening 
approaches, does not require immobilization technologies, and compounds can be 

20 synthesized and tested free in solution using virtually any type of screening assay. It is 
important to note, that the high degree of specificity of enzyme reactions on 
fimctional groups allows for the "tracking" of specific enzymatic reactions that make 
up the biocatalytically produced library. 

Many of the procedural steps are performed using robotic automation enabling 
25 the execution of many thousands of biocatalytic reactions and screening assays per 
day as well as ensuring a high level of accuracy and reproducibility. As a result, a 
library of derivative compounds can be produced in a matter of weeks which would 
take years to produce using current chemical methods. 

In a particular embodiment, the invention provides a method for modifying 
30 small molecules, comprising contacting a polypeptide encoded by a polynucleotide 
described herein or enzymatically active fragments thereof with a small molecule to 
produce a modified small molecule. A library of modified small molecules is tested 
to determine if a modified small molecule is present within the library which exhibits 
a desired activity. A specific biocatalytic reaction which produces the modified small 
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5 molecule of desired activity is identified by systematically eliminating each of the 
biocatalytic reactions used to produce a portion of the library, and then testing the 
small molecules produced in the portion of the library for the presence or absence of 
the modified small molecule with the desired activity. The specific biocatalytic 
reactions which produce the modified small molecule of desired activity is optionally 
1 0 repeated. The biocatalytic reactions are conducted with a group of biocatalysts that 
react with distinct structural moieties found within the structure of a small molecule, 
each biocatalyst is specific for one structural moiety or a group of related structural 
moieties; and each biocatalyst reacts witti many different small molecules which 
contain the distinct structural moiety. 

' 5 In another embodiment, the novel alkaline amylases of the invention were 

identified by screening for both activity at high pH and identification of amylases 
with stability in an automatic dish wash (ADW) formulation. Comparisons were 
made to the amylase derived fi*om Bacillus lichenformis. A study of the dependence 
of hydrolysis on pH showed that the majority of the alkaline amylases of the invention 

20 have a pH optima of 7 or less, the exception is clone B with a pH optima of 

approximately 8. The alkaline amylases of the invention retain activity in ADW 
formulations, though clone B is sensitive to high temperatures. Preferably, when used 
in ADW products, the alkaline amylase of the invention will function at a pH 10-1 1 
andat45-60X. 

25 The invention will be fiirther described with reference to the following 

examples; however, it is to be understood that the invention is not limited to such 
examples. 

EXAMPLES 

EXAMPLE 1 



30 IDENTIFICATION AND CHARACTERIZATION OF THERMOSTABLE a- 

AMYLASES 
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5 The present example shows the identification of novel acid amylases. The 

screening program was carried out under neutral and low pH conditions. DNA 
libraries generated from low pH samples were targeted for discovery. This effort 
afforded the discovery of hundreds of clones having the ability to degrade starch. 
DNA sequence and bioinformatic analyses classified many of these genes as 
10 previously unidentified amylases. 

Biochemical studies 

Biochemical analysis of the amylase genomic clones showed that many had 
pH optima of less than pH 6. Lysates of these genomic clones were tested for thermal 
tolerance by incubation at 70**C, 80X, 90**C or lOO^C for 10 minutes and 

15 measurement of residual activity at pH 4.5. Those clones retaining >50% activity 
after heat treatment at 80®C were chosen for fiirther analysis. These clones were 
incubated at 9QfC for 10 minutes at pH 6.0 and 4.5 and tested for residual activity at 
pH 4.5 (Figure 1). A number of clones retained >40% of their activity following this 
treatment. For comparative purposes, residual activity of an evolved amylase, clone c, 

20 was equivalent to the best of the second-generation enzymes; the specific activity of 
clone c was greater. 

Thermal activity of the clones with residual activity after heat treatment at 
90°C at pH 4.5 was measured at room temperature, 70®C and 90**C at pH 4.5. Table 1 
shows that the hydrolysis rates of SEQ ID NO.: 87 (A stearothermophilus amylase) 
25 and SEQ ID NO. 113 (5. licheniformis amylase) decrease at higher temperatures, 
whereas the rate for SEQ ID NO.: 125 continues to increase as the temperature is 
raised to 70**C and only reduces by around 50% at 90**C. 
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5 Candidate evaluation 

Based on residual activity at pH 4.5 after a 90 X heat treatment, specific 
activity and rate of starch hydrolysis at 90 ''C when compared with B.licheniformis 
amylase, SEQ ID NO.: 125 is compared with the evolved amylase clone c in a starch 
liquefaction assay. 

10 





Room 
temperature 


70°C 


90° 


SEQIDNO.:87' 


1.25 


1.43 


0.33 


SEQ ID NO.: 

113^ 


3.3 


1.9 


0.39 


SEQ ID NO.: 125 


1.9 


47 


19 



Table 1. Rates of dye labeled starch hydrolysis (relative fluorescence units/s) 
of three genomic clones at pH 4.5 and 3 different temperatures. ^B. 
1 5 stearothermophilus amylase, ^B. licheniformis amylase 

EXAMPLE 2 

THERMOSTABLE AMYLASES ACTIVE AT ALKALINE pH 

The initial focus of this example was the evaluation of an existing panel of 
20 amylases in an commercial automatic dish wash (ADW) fomiulation. This effort 

identified two candidates: one with activity at high pH (SEQ ID NO.: 11 5) and another 
with stability in the ADW formulation (SEQ ID NO.:207). Studies also included the 
identification of high pH amylases. This effort afforded the discovery of hundreds of 
clones having the ability to degrade starch. DNA sequence and bioinformatics 
25 analyses classified many of these genes as previously unidentified amylases. The 
remaining open reading fi-ames were neopullulanases, amylopullulanases and 
amylomaltases. Extensive biochemical and applications studies showed that 3 
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5 candidates: clone B, SEQ ID NO.:147 and SEQ ID NO.:139) have high specific 
activity at pH 1 0, but unfortunately lack stability in the ADW formulation. In 
summary, a panel of novel amylases each having desirable phenotypes for the ADW 
application has been identified. 

10 Biochemical studies 

Biochemical analysis of the amylase genomic clones showed that many of 
them hydrolyzed starch at pH 10 and SO'^C. To produce sufficient quantities of 
enzyme for further biochemical and applications testing, the amylase open reading 
frames of the 40 most active genomic clones were subcloned into expression vectors. 
1 5 This effort included making 2 constructs for those clones containing a putative signal 
sequence and establishing the growth and induction conditions for each subclone 
(plus and minus the amylase signal peptide). 

Soluble, active protein was successfully purified to homogeneity from 34 
subclones and specific activity (units/mg, where 1 unit = \imol reducing sugars/min) 

20 was measured at pH 8 and pH 10 (40**C and 50*C) using 2% starch in buffer. The 
amylase from Bacillus licheniformis (SEQ ID N0.:1 13) was chosen as the 
benchmark for these studies. Specific activity was determined by removing samples 
at various time points during a 30 minute reaction and analyzing for reducing sugars. 
The initial rate was determined by fitting the progress curves to a linear equation. A 

25 comparison of the top candidates is shown in Table 2. 

A study to detemiine the dependence of hydrolysis rate on pH showed that 
only clone B is an "alkaline amylase" with a pH optimum of approximately 8; all 
others had pH optima of 7 or less. Nevertheless, it is clear that the panel of hits 
included several lead amylases with appreciable activity at pH 10 and 50°C. 



Table 2.Specific activities (1 


J/mg pure enryme) of amylases 


Enzyme 


Specific activity 
pH 8, 40»C 


Specific activity 
pH 10, 50*>C 


Clone B 


682 


20 


SEQ ID NO.:139 


430 


33 
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5 



SEQIDNO.:127 


250 


47 


SEQ ID NO.:137 


230 


3 


SEQ ID NO.:113 
(B. licheniformis) 


228 


27 


SEQ ID NO.:205 


163 


4 


Remainder 


<40 





Stability 

Stability in the presence of the ADW fonnulation was measured for each of 
the 3 top candidates identified via biochemical analysis. The benchmark for these 

10 studies was a commercial enzyme in the formulation matrix. Figure 1 3 illustrates the 
residual activity (measured at pH 8 and 50^C) after a 30 minute mcubation at 50*C in 
the presence of various components of the ADW formulation; pH 8, pH 10.8, ADW 
solution (with bleach) and ADW solution (without bleach). The measured activity 
after the incubation is expressed as a percentage of the original activity. The data 

1 5 show that clone B was very sensitive to high temperature, whereas the other amylases 
were less affected. When the enzymes were incubated at high pH and temperature, 
the commercial enzyme SEQ ID NO.: 139 became less stable; however, SEQ ID NO.: 
127 retained full activity. The apparently anomalous behavior of SEQ ED NO.: 127 
after pH 10 incubation vs pH 8 was observed in repeated trials. 

20 When amylase activity on dye-labeled starch is measured in the ADW matrix 

at 50**C, the commercial amylase exhibits roughly 5% of its activity at pH 8. In the 
same assay, clone B, SEQ ID NO.: 139 and SEQ ID NO.: 127 exhibit <2% of their 
original activity measured at pH 8. 

Wash tests 

25 Wash tests using starch coated slides were carried out to gauge the 

performance of each of the purified enzymes as compared to the commercial amylase. 
The spaghetti starch coated slides were prepared according to protocol. Two pre- 
weighed starch coated slides were placed back to back in a 50 mL conical tube and 25 
mL of ADW solution, +/- enzyme were added per tube. The tubes were incubated for 
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5 20 minutes at 50°C with gentle rotation on a vertical carousel. Following the 
incubation period, the slides were immediately rinsed in water and oven dried 
overnight. All trials were run in duplicate and the commercial enzyme was run as a 
positive control. The results (Figure 6) of these experiments are expressed as net % 
starch removed, e.g. % of starch removed in ADW with enzyme, minus the % of 
1 0 starch removed in ADW alone. 



EXAMPLE 3 

GENE OPTIMIZATION 

The properties of enzymes may be improved by various evolution strategies, 
1 5 including GeneSiteSaturationMutagenesis (GSSM™) and GeneReassembly™ . 
(Diversa Corporation, San Diego, CA). Such techniques will be applied to the 
discovered amylase genes in order to generate pools of variants that can be screened 
for improved performance. 

Parental molecules for evolution will be one or all of the following: SEQ ID 
20 NO.: 113, SEQ ID NO.: 139, SEQ ID N0.:1 15 and SEQ ID NO.: 127 (a truncated 
form of SEQ ID NO.: 127). 

A high throughput screen has been developed to assess enzyme performance 
in the presence of ADW performance. Development of a HTS is of paramount 
importance in any evolution program The HTS is automated and has showed 
25 consistent results for the parental amylases (Figure 7). Parental amylases have 

measurable activity in the ADW formulation, however highly reduced relative to pH 8 
activity. 

Mutants of SEQ ID NO:81 Generated by GSSM or by Random Mutation 
GSSM 

30 Host: Rcoli,XLl.blueMRF* 
Vector: pSE420 
Parent: SEQIDN0:81 
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5 The mutated residues included #2 to 93, #1 18-159, #389 and #433. 



List of elevated active amylase clones tliat have been sequenced 





Name 


Change 


Data in E. coli 


10 


SEO ID NO-81/4V 


i4V, lAl tOvjiU 


2x increase m activity 






Y4A, TAT to GCG 


2x increase in activity 


15 


SEQIDNO:81/30V 


T30V, ACC to GTT 


2.Sx increase in activity 




SEQIDNO:81/31Q 


I31Q,ATCtoTAG 


1 .5x increase in activity 




SEQIDNO:81-4V.30V 


4V+30V 


3x increase in activity 


20 


SEQIDNO:81-2C 


C154A&C155A 


1 .2x increase in activity 




SEQIDNO:81-4C 


C154A,C155A& 
C389A, C433A 


2x increase in activity 


25 


Note: 







• For SEQ ID NO:81/31Q, the TAG codon should code for a stop codon, but it 
codes for Gin in XLl-blue MRF' cells because of supE mutation of the host. 



30 

Random Mutation 

Random mutations were introduced to SEQ ID N0:81 by error-prone PGR as 
follows: 

35 • Use ofunbalanced dATP, dCTP, dGTP or dTTP in reaction to introduce 

mutations 

• Use of different concentration of MnCh in reaction to introduce mutations 

• Use of combination of above two method 

40 Confirmation of new active clones: 

• Grow the up-mutants from primary screening in 1 5ml Falcon tubes and induce 
withlmMIPTG 

• Re-assay using RBB-starch as substrate 

• Sequence the clone to confirm the mutation. 

45 

List of elevated active amylase clones that have been sequenced 



wo 02/068589 



PCTAJS02/05068 



96 

Data in Shake flask Data in fermentor 

1 .3-2x increase in activity same as SEQ ID N0:8 1 - 

2x increase in activity same as SEQ ID N0:81- 



EXAMPLE 4 

CHARACTERIZATION OF a-AMYLASES HAVING ACTIVITY AT 

ALKALINE dH 

Amylases of the invention having activity at alkaline pH were characterized 
further. Kinetics on 2% starch at pH 8 andlO (40'»C and SCC) have been performed. 



Table 4: 



Clones, specific activities 


dH 8, 40**C 


dH 10. 


SEQ ID NO.: 1 13 (A lichenoformis) 


228 units/mg 


27 


units/mg 




Clone B 


682 units/mg 


31 


units/mg 






SEQ ID NO.: 139 


430 units/mg 


33 


units/mg 




SEQ ID NO.: 127 


540 units/mg 


50 


units/mg 




control 0GL5 (£. coli) 


1 .8 units/mg 


0 units/mg 



1 unit of activity is deSned as release of 1 ^ol reducing sugars per 
minute. 



35 



EXAMPLE S 

AMYLASE ACTIVITY ASSAYt BCA REDUCING ENDS ASSAY 

40 Amylase activity of clones of interest was determined using the following 

methodology. 



1 . Prqiare 2 substrate solutions, as follows: 



5 Name 

SEQlDNO:81-4C-UMl 
4C 

SEQIDN0:81-4C-UM2 
4C 
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5 a) 2% soluble starch (potato) pH 8 solution by dissolving 2 gm potato 

starch in 100 ml 100 mM sodium phosphate pH 8). 

b) 2% soluble starch (potato) pH 1 0 solution by dissolving 2 gm potato 
starch in 100 ml 100 mM sodium carbonate. 

Heat both solutions in a boiling water bath, while mixing, for 30-40 minutes 
1 0 until starch dissolves. 

2. Prepare Solution A from 64 mg/ml sodium carbonate monohydrate, 24 
mg/ml sodium bicarbonate and 1.95 mg/ml BCA (4,4*-dicarboxy.2,2'. biquinoline 
disodium salt (Sigma Chemical cat # D-8284). Added above to dH20. 

3. Prepare solution B by combining 1 .24 mg/ml cupric sulfate 
15 pentahydiate and 1.26 mg/ml L-serine. Add mixture to to dH20. 

4. Prepare a working reagent of a 1 : 1 ration of solutions A and B. 

5. Prepare a Maltose standard solution of 1 0 mM Maltose in dH20, 
where the 10 mM maltose is combined in 2% soluble starch at desired pH to a final 
concentration of 0, 100, 200, 300, 400, 600 ^M. The standard curve will be generated 

20 for each set of time-points. Since the curve is determined by adding 10 ul of the 
standards to the working reagent it works out to 0, 1, 2, 3, 4, 6 nmole maltose. 

6. Aliquot 1 ml of substrate solution into microcentrifuge tubes, 
equilibrate to desired temperature (5 min) in heat block or heated water bath. Add 50 
ul of enzyme solution to the inside of the tube lid. 

25 7. While solution is equilibrating mix 5 ml of both solution A & B. 

Aliquot 100 ul to 96 well PCR plate. Set plate on ice. 

8. After 5 minute temperature equilibration, close lid on tubes, invert and 
vortex 3 times. Inunediately aliquot 10 ul into plate as t=K) (zero time point). Leave 
enzyme mixture in heat block and aliquot 10 ul at each desired time point (e.g. 0, 5, 
30 10,15, 20, 30 minutes). 
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5 9. Ensure that 1 2 wells are left empty (only working reagent aliquotted) 

for the addition of 10 ul of standards, for the standard curve. 

10. When all time points are collected and standards are added, cover plate 
and heated to 80^ C for 35 min. Cool plate on ice for 10 min. Add 100 ul H20 to all 
wells. Mix and aliquot 100 ul into flat bottomed 96-well plate and read absorbance at 

10 560 nm. 

1 1 . Zero each samplers time points against its own t=0 (subtract the 
average t=0 A560 value from other average A560 values). Convert the 
A560(experimental) to umole (Divide A560(experimental) by the slope of the 
standard curve (A560/umole). Generate a slope of the time points and the umole (in 

15 umole/min), multiply by 100 (as the umole value only accounts for the 10 ul used in 
the assay, not the amount made in the 1ml rxn). To get the specific activity divide the 
slope (in umole/min) by the mg of protein. All points should be done at a minimum 
in duplicate with three being best. An example standard curve is set forth in Figure 
11. 



20 



Table 5: Sample data: 

(A560exp/std 
slope) 

Clone Dilution Minutes AS60-1 AS60-2 Ave A 560 Zeroed A 560 umole 
ENZ 50 0 0.1711 0.1736 0.17235 0 0.0000 

5 0.2104 0.2165 0.21345 0.0411 0.0005 
10 0,2492 0.2481 0.24865 0.0763 0.0009 
15 0.2984 0.2882 0.2933 0.12095 0.0014 
20 0.3355 0.3409 0.3382 0.16585 0.0020 
30 0.3942 0.3805 0.38735 0.215 0.0026 
40 0.4501 0.4412 0.44565 0.2733 0.0033 



Activity =0.008646 umole/min 

Divide protein concentration (mg/ml) by any dilution to get mg used in assay. 
25 Divide the above slope by mg used in assay to get specific activity 
Specific Activity =24.93 umole/min/mg 
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(See for example, Dominic W.S. Wong, Sarah B, Batt, and George H. 
Robertson (2000). Microassay for rapid screening of alpha-amylase activity. J. 
AgricFoood Chem. 48.4540-4543 and Jeffrey D. Fox and John F. Robyt. (1991). 
Minituratization of three carbohydrate analyses using a micro sample plate reader. 
Anal. Biochem. 195, 93-96, herein incoiporatedbyreiference). 



EXAMPLE 6 
SCREENING FOR g-AMYLASE ACTIVITY 

15 Amylase activity of clones can be assessed by a number of methods known in 

the art. The following is the general methodology that was used in the present 
invention. The number of plaques screened, per plate, should be approximately 
10,000 pfu's. For each DNA library: at least 50,000 plaques per isolated library and 
200,000 plaques per non-isolated library should be screened depending upon the pfii 

20 titer for the X Zap Express amplified lysate. 

Titer determination of Lambda Librarv 

1 ) jiL of Lambda Zap Express amplified library stock added to 600^iL E, coli MRF* 
cells (OD(soo=1.0). To dilute MRF* stock, lOmM MgS04 is used. 

2) Incubate at 37*^0 for 15 minutes. 

3) Transfer suspension to 5-6mL of NZY top agar at 50 and gently mix. 

4) Inmiediately pour agar solution onto large (1 50mm) NZY media plate. 

5) Allow top agar to solidify completely (approximately 30 minutes), then invert 
plate. 

6) Incubate the plate at 39 for 8-12 hours. 

7) Number of plaques is approximated. Phage titer determined to give 10,000 
pfu/plate. Dilute an aliquot of Library phage with SM buffer if needed. 



Substrate screening 
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5 

1) Lambda Zap Express (50,000 pfii) from amplified library added to eOO^L of £. 
coli MRF* cells (OD600=1.0). For non-environment libraries, prepare 4 tubes 
(50,000 pfii per tube). 

2) Incubate at 37 ®C for 15 minutes. 

10 3) While phage/cell suspension are incubating, l.QmLofred starch substrate (1.2% 
w/v) is added to 6.0mL NZY top agar at 50 and mixed thoroughly. Keep 
solution at 50^C until needed. 
4) Transfer 1/5 (10,000 pfii) of the cell suspension to substrate/top agar solution and 
gently mixed. 

1 5 5) Solution is immediately poured onto large (1 50mm) NZY media plate. 

6) Allow top agar to solidify completely (approximately 30 minutes), then invert 
plate. 

7) Repeat procedures 4-6 4 times for the rest of the cell suspension (1/5 of the 
suspension each time). 

20 8) hicubate plates at 39 for 8-12 hours. 

9) Plate observed for clearing zones (halos) around plaques. 

10) Plaques with halos are cored out of agar and transferred to a sterile micro tube. A 
large bore 200)iL pipette tip works well to remove (core) the agar plug containing 
the desired plaque. 

25 1 1) Phages are re-suspended in 500fiL SM bufifer. 20^L Chloroform is added to 
inhibit any fiirther cell growth. 
12) Pure phage suspension is incubated at room temperature for 4 hours or overnight 
before next step. 

30 Isolation of pure clones 

1) lO^L of re-suspended phage suspension is added to 500(iL of £. coli MRF' cells 
(OD600=1.0). 

2) tacubate at 37 for 15 minutes. 
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5 3) While phage/cell suspension is incubating, 1 mL of red starch substrate ( 1 .2% w/v) 
is added to 6,0mL NZY top agar at 50 ^'C and mixed thoroughly. Keep solution at 
50 X until needed. 

4) Cell suspension is transferred to substrate/top agar solution and gently mixed. 

5) Solution is immediately poured onto large (150mm) NZY media plate. 

1 0 6) Allow top agar to solidify completely (approximately 30 minutes), then invert 
plate. 

7) Plate incubated at 39 for 8-12 hours. 

8) Plate observed for a clearing zone (halo) around a single plaque (pure clone). If a 
single plaque cannot be isolated, adjust titer and re-plate phage suspension. 

1 5 9) Single plaque with halo is cored out of agar and transferred to a sterile micro tube. 
A large bore 200^L pipette tip works well to remove (core) the agar plug 
containing the desired plaque. To amplify the titer, core 5 single active plaques 
into a micro tube. 

10) Phages are re-suspended in SOO^iL SM buffer. 20^L Chloroform is added to 
20 inhibit any further cell growth. 

1 1) Pure phage suspension is incubated at room temperature for 4 hours or overnight 
before next step. The pure phage suspension is stored at -80 ''C by adding DMSO 
into the phage suspension (7% v/v). 

25 Excision of pure clone 

1) 100|iL of pure phage suspension is added to 200^L £. coli MRF' cells 
(OD600=1.0). To this, l.OjiLofExAssist helper phage (>1 x 106 
pfu/mL;Stratagene) is added. Use 2059 Falcon tube for excision. 
30 2) Suspension is incubated at 37 X for 15 minutes; 

3) 3.0 mL of 2 X YT media is added to cell suspension. 

4) Incubate at 30 **C for at least 6 hours or overnight while shaking. 

5) Tube transferred to 70 **C for 20 minutes. The phagemid suspension can be stored 
at 4 °C for 1 to 2 months. 
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5 6) 1 00 ^L of phagemid suspension transferred to a micro tube containing 200jiL of 
E. coli Exp 505 cells (OD600=1 .0). 

7) Suspension incubated at 37 for 15 minutes. 

8) 300(iL of SOB is added to the suspension. 

9) Suspension is incubated at 37 for 30 to 45 minutes. 

10 1 0) 1 OO^L of suspension is transferred to a small (90mm) LB media plate containing 
Kanamycin (LB media with Kanamycin 50fig/mL) for Zap Express DNA libraries 
or Ampicillin (LB media with Kanamycin lOO^ig/mL) for Zap II DNA libraries. 
1 l)The rest of suspension is transferred to another small LB media plate. 
12) Use sterile glass beads to evenly distribute suspension on the plate. 
15 13) Plates are incubated at 30 ^'C for 12 to 24 hours. 

14) Plate observed for colonies. 

15) Inoculate single colony into LB liquid media containing suitable antibiotic and 
incubate at 30 '^^C for 12 to 24 hours. 

16) Glycerol stock can be prepared by adding 80% glycerol into liquid culture (15% 
20 v/v) and stored at -80 

Activity verification 

25 1) 50jiLofliquid culture is transferred to a micro tube. Add 500jiLof 8%pH7 
Amylopectin Azure into the same tube. Prepare 2 tubes for each clone. 

2) Activity is tested at 50 for 3 hours and overnight. Use pH 7 buffer as control. 

3) Cool the test specimen at ice-water bath for 5 minutes. 

4) Add 750^L of Ethaqnol and mixed thoroughly. 

30 5) Centrifuge at 13000 rpm (16000 g's) for 5 minutes. 
6) Measure OD of the supernatant at 595nm. 

RFLP analvsis 



35 



1) 1 .OmL of liquid culture is transferred to a sterile micro tube. 
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5 2) Centrifuge at 1 3200 rpm (16000 g's) for 1 minute. 

3) Discard the supernatant. Add another 1 .0 mL of liquid culture into the same 
sterile micro tube. 

4) Centrifuge at 13200 rpm (16000 g's) for 1 minute. 

5) Discard the supernatant 

1 0 6) Follow QIAprep spin mini kit protocol for plasmid isolation. 

7) Check DNA concentration using BioPhotomcter. 

8) Use Sac I and Kpn I for first double digestion. Incubate at 37 *C for 1 hour. 

9) Use Pst I and Xho I for second double digestion. Incubate at 37 ®C for 1 hour. 

1 0) Add Loading dye into the digested sample. 

15 11) Run the digested sample on a 1 .0% agarose gel for 1-1 .5 hours at 120 volts. 

12) View gel with gel imager. All clones with a different digest pattern will be sent 
for sequence analysis. 

EXAMPLE 7 

20 ASSAY FOR AMYLASES 

Preparation Of Host Cultures 

1 . Start an overnight culture of XL 1 -Blue MRF' host cells. Use a single colony 
from a streak plate to inoculate 10 mL LB supplemented with 20 ug/mL 
tetracycline. Grow overnight culture shaking at 37**C for at least 16 hours. 
25 2. Using aseptic technique^ inoculate a fresh 1 00 mL of LBjct day culture with 
XLl-Blue MRF' host bom the overnight LBjet culture. 

3. Grow in a 37'*C shaker until the CD reaches 0.75 - 1 .0. 

4. Pellet host cells at 1000 x g for 10 minutes and gently resuspend in 10 mM 
MgS04 at 0D5. 

30 5. Dilute a small amount of host cells to ODl for use in titering and pintooling. 
6. Host preparations can be used for up to 1 week when stored on ice or at 4**C. 



COMMENTS 
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5 -To shorten growth time for the day culture, use /2X the usual Tet 

concentration in LB (V4X = 10 ug/mL), or omit the antibiotic altogether. 

-Do not use NZY when selecting with Tetracycline. The high Mg""" 
concentration in NZY medium renders Tet inactive. 



10 Titering Lambda Libraries 

7. Place three sterile microfuge tubes in a rack. 

8. Aliquot 995 uL prepared host cells in one tube and 45 uL prepared OD 1 host 
cells into each of the two remaining tubes. 

1 5 9. Add 5 uL of lambda library to the tube containing 995 uL host cells and mix 

by vortexing. This results in a dilution factor of 200. 
10. Prepare 1/2,000 and 1/20,000 dilutions by consecutively adding 5 uL of 

previous dilution to the remaining two tubes containing 45 uL prepared host 

cells. Mix by vortexing after each dilution was made. 
20 11. Allow phage to adsorb to host by incubating at 37**C for 1 5 minutes. 

12. Meanwhile, pipet 100 uL of prepared ODl host cells to each of three Falcon 
2059 tubes. 

13. Add 5 uL of each dilution to a separate 2059 tube containing host cells. 

14. Plate each by adding 3 mL top agar to each tube and quickly pour over 90 mm 
25 NZY plates. Ensure a smooth, even distribution before the top agar hardens. 

1 5. Invert plates and incubate at 37^C overnight. 

16. Count plaques and calculate titer of the library stock (in plaque forming units 
(pfii) per uL). 

30 Lambda Microtiter Screening For Amylases 
Preparation 
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5 1 . Prepare a sufficient amount of XLl-Blue MRF' host culture, as described 
above, for the amount of screening planned. A culture of 100 mL is usually 
sufficient for screening 2-3 libraries. 

2. Autoclave several bottles compatible with the QFill2 dispenser. These are 
the wide-mouth Coming bottles, 250 mL containing a sealing ring around the 
10 lip. 

Make sure there are sufficient amounts of plates, top agar, BODIPY 
starch, red starch solution, etc. available for the screen. 

Schedule the Day 2 robot run with a representative from Automation. 

Day 1 

Label the 1536-well plates (black) with library screen and plate number. 
Tough-Tags™ tube stickers, cut in half width-wise, are ideal for labeling 1536 
well plates. 

Calculate volumes of library, host cells and NZY medium necessary for 
the screen. This is easily done with an Excel spreadsheet. 

Combine the calculated volumes of lambda library and OD5 host cells in 
a sterile 250 mL wide-mouth Coming bottle (containing a sealing ring). 

Allow adsorption to occur at 37T for 15 minutes. 

Add the calculated volume of NZY medium and mix well. This is referred 
to as the cell-phage-medium suspension. 

25 6. Perform a concomitant titer by combining 50 uL of the cell-phage- 

medium suspension with 250 uL of ODl host cells in a Falcon 2059 tube, then 
plating with 9 mL of top agar onto a 1 50 mm NZY plate. Incubate 
concomitant titer plate at 37X overnight. 



15 1. 
2. 

20 3. 
4. 
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Load the dispenser with the remainder of the suspension and array each 
labeled 1536-well plate at 4 uL per well. If the dispenser leaves air bubbles in 
some wells, they can be removed by centrifiiging the plates at 200 x g for 1 
minute. 

Add 0.5 uL of positive control phage to well position AD46 of at least 
two of the assay plates. Use a strong amylase-positive lambda clone for this 
purpose. The lambda versions of SEQ ID NO.: 1 13 or SEQ ID NO.: 199 are 
good choices for positive controls. 

9. Incubate assay plates at 3TC overnight in a humidified (^95%) incubator. 



8. 

10 



15 Dav2 

1 . Count the pfu on the concomitant titer plate and determine the average seed 
density per well (in pfu per well). 

2. Pintool at least 2 plates of each library screen (preferably the 2 containing 
positive controls) as follows: 

20 a) Prepare 2 host lawn plates to act as a surface on which to pintool: 

combine 250 uL of ODl host cells with 2 mL 2% red starch and plate 
with 9 mL top agar onto 1 50 mm NZY plates. Hold each plate as level as 
possible as the top agar solidifies in order to produce an even hue of red 
across the plate. 

25 b) Using a twice flame-sterilized 1 536 position pintool, replicate at least 2 

of the screening plates onto the host lawn plates. 

c) Place the pintooled recipient plates in a laminar flow hood with the lids 
off for about 15-30 minutes (to vent off excess moisture). 

d) Replace the lids and incubate inverted at 37**C overnight, 
30 3. Prepare the 2X BODIPY starch substrate buffer as follows: 

a) Calculate the total volume of 2X substrate buffer solution needed for all 
screening plates at 4 uL per well (including any extra deadspace volume 
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required by the dispenser) and measure this amount of 100 mM CAPS 
pH 10.4 into a vessel appropriate for the dispenser used. 

b) Retrieve enough 0.5 mg tubes of BODIPY starch to produce the required 
volume of 2X substrate buffer [calculated in step a) above] at a final 
concentration of 20-30 ug/mL. 

c) Dissolve each 0.5 mg tube in 50 uL DMSO at room temperature, 
protected from light, with fi-equent vortexing. This takes more than 15 
minutes; some production lots of BODIPY starch dissolve better than 
others. 

d) Add 50 uL lOOmM CAPS buflFer pH 10.4 to each tube and mix by 
vortexing. 

e) Pool the contents of all tubes and remove any undissolved aggregates by 
ccntrifiiging for 1 minute at maximum speed in a microfuge. 

f) Add the supernatant to the rest of the 1 00 mM CAPS buffer measured in 
step a) above. 

g) Protect the 2X substrate buffer fix>m light by wrapping in foil. 

4. Take plates and substrate buffer to the automation room and program the robot 
with the following parameters: 

a) dispense 4 uL substrate buffer per well 

b) 1 ^ read at 1 hour post-substrate, 2™* read at 9 hours, and third read at 1 7 
hours; with 37**C incubation between reads 

c) excitation filter: 485 nm; emission filter: 535 nm 

d) set the Spectrafluor gain at 70, or the optimal gain for the batch of 2X 
substrate buffer prepared. 

e) ensure that the incubator used will protect assay plates from light. 

Day 3 

1 . Check pintooled plates for clearings in the bacterial lawn at all 

positions corresponding to wells on the associated assay plate. Also check for 
clearings in the red starch in any of the pin positions. If plates containing 
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5 positive controls were used for pintooling, you should be able to see a large 

clearing zone in the red background. Be wary of contaminants that also form 
clearing zones in red starch (see comment "Contaminants That Form Clearing 
Zones in Red Starch" at end of Example 7). 

2. Identify putative hits from the data file produced by the robot 

1 0 computer. The KANAL program produced by Engineering simplifies data 

analysis. As a rule of thumb, a putative hit is characterized as a well having 
signal intensity rising at least l.S fold over background. 

3. For each putative^ remove 2 uL from the well and add to a tube 
containing SOO uL SM buffer and SO uL CHC13. Vortex to mix and store at 

15 4^C. This solution will be referred to hereafter as the 4e-3 stock. The original 

screening plates should be stored at 4""^ protected from light, at least until 
breakouts are complete. 



This is the reconunended method of breaking out putative hits. It is a liquid 
phase assay that relies on confirmation of activity on BODIPY starch. Alternatively, 
putative hits can be plated directly onto solid phase plates containing red starch such 
that 2,000-3,000 pfii per hit are examined for clearing zones. However, inability to 
observe clearing zones on red starch is not necessarily ah indication that a putative hit 
was a false positive. It would then need to be assayed using the format in which it was 
originally identified (i.e., liquid phase using BODIPY starch as substrate). In addition, 
very weak positives are more easily identified using the method detailed below. 

Day 1 

1 . In a sterile 50 mL conical tube, combine 0.5 mL 0D5 host cells with 45.5 mL 
NZY. This will be referred to as the host-medium suspension. 
30 2. For each putative hit to be analyzed, aliquot 1 mL of host-medium suspension 
into each of 3 three sterile microfiige tubes. 
3. Set the 12-channeI pipetman in multidispense mode with an aliquot size of 20 
uL and an aliquot number of 2x. Mount the pipetman with a clean set of sterile 
tips. 



20 



25 
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5 4. Pour about 1 mL of host-medium suspension into a new sterile solution basin 
and load the multichannel pipetman. 

5. Dispense 20 uL per well into the last row (row P) of a black 384-well plate (12 
channels x 2 = 24 wells). This row will be used later for the controls. 

6. Expel the remaining liquid in the tips by touching the tips against the surface 
10 of the basin and pressing the RESET button on the pipetman. Lay the 

pipetman down in a way to prevent contamination of the tips. There is no need 
to change the tips at this point. 

7. Pour the remainder of the fluid in the basin into a waste container (like a 
beaker) taking care to avoid splash-back contamination. 

1 5 8. For the first putative to be analyzed, take 1 1 1 uL of the 4e-3 stock (see Day 2 
in Lambda Microtiter Screening for Amylases) and add it to the first in a set of 
three tubes containing 1 mL host-medium suspension (step 2). Vortex to mix. 
This is Dilution A. 

9. Take 1 1 1 uL of Dilution A and add to the next tube in the set. Vortex to mix. 
20 Thxs 'i^ Dilution B. 

10. Take 1 1 1 uL of Dilution B and add to the last tube in the set. Vortex to mix. 
This is Dilution C. You should now have three dilutions of phage, where 
concentrations of each differ by a factor of 1 0. 

1 1 . Pour the contents of Dilution C (the most dilute of the 3 samples) into the 
25 solution basin and load the multichannel pipetman: 

12. Dispense 20 uL per well into the first row of the 384-well plate (12 channels x 
2 = 24 wells). 

13. Expel the remaining liquid in the tips by touching the tips against the surface 
of the basin and pressing the RESET button on the pipetman. Lay the 

30 pipetman down in a way to prevent contamination of the tips. There is no need 

to change the tips at this point. 

14. Empty the basin as described above. 

15. Pour the contents of Dilution B into the same basin and load the multichannel 
pipetman. 

35 16. Dispense 20 uL per well into the second row of the 384-well plate. 
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5 17. Perform stqps 13-16 similarly to dispense Dilution A into the third row of the 
plate. 

18. After all three dilutions have been arrayed into the first 3 rows of the plate, 
discard all tips and the solution basin into the biohazardous waste container. 

19. Mount the pipetman with a clean set of sterile tips and open a new sterile 
10 solution basin. 

20. Repeat steps 8-19 for each remaining putative hit, using remaining rows on the 
plate up to row O. Five putative hits can be analyzed on one 384-well plate, 
with the last row (row P) saved for the controls. 

2 1 . Add O.S uL of each control to a separate well. Use at least 2-3 sisparate 
1 5 controls, preferably covering a range of activity. 

22. Incubate assay plates at 3TC ovemi^t in a humidified (^95%) incubator. 



Day 2 



20 1 . Pintool all breakout plates onto a host lawn with red starch using the same 
^method described for Day 2 in Lambda Microtiter Screening for Amylases ^ 
except that a 384 position pintool is used. 
2. Prepare the 2X BODIPY starch substrate buffer as follows: 

a) Calculate the total volume of 2X substrate buffer solution needed for all 
25 breakout plates at 20 uL per well (including any extra deadspace volume 

required by the dispenser) and measure this amount of 100 mM CAPS 
pH 10,4 into a vessel appropriate for the dispenser used. 

b) Retrieve enough 0.5 mg tubes of BODIPY starch to produce the required 
volume of 2X substrate buffer [calculated in step a) above] at a final 

30 concentration of 20-30 ug/mL. 

c) Dissolve each 0.5 mg tube in 50 uL DMSO at room temperature, 
protected firom light, with frequent vortexing. This takes more than 15 
minutes; some production lots of BODIPY starch dissolve better than 
others. 
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5 d) Add 50 uL lOOmM CAPS buffer pH 10.4 to each tube and mix by 

vortexing. 

e) Pool the contents of all tubes and remove any undissolved aggregates by 

centrifuging for 1 minute at maximum speed in a microfiige. 
0 Add the supernatant to the rest of the 100 mM CAPS buffer measured in 
10 step a) above. 

g) Protect the 2X substrate buffer from light by wrapping in foil. 

3. Dispense 20 uL per well into all breakout plates. 

4. Wrap all plates in aluminum foil and incubate at room temperature for 2-6 
hours. 

IS S. Read each plate in the Spectrafluor with the following settings: 

a) fluorescence read (excitation filter: 485 nm; emission filter: 535 nm) 

b) plate definition: 384 well black 

c) read from the top 

d) optimal gain 

20 e) number of flashes: 3 

6. On the resulting Excel spreadsheet, chart each putative's 3 rows in a separate 
graph and check for activity. Ensure that the positives controls produced 
signals over background. 

7. For each putative that appears to have a real signal among the wells, harvest a 
25 sample from a positive well as follows: 

a) Select a positive well from a row representing the highest initial dilution. 

b) Transfer 2 uL from that well into a tube containing 500 uL SM and 50 uL 
CHCI3. This is referred to as the breakout stock. 

c) Store at 4°C. 

30 8. Using methods previously described, plate about 1 0 uL of each breakout stock 
onto 150 mm NZY plates using red starch. The objective is to obtain several 
(at least 20) well-separated plaques from which to core isolates. 



Dav3 
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5 1 . Check pintooled plates for an acceptable incidence of clearings in the bacterial 
lawn conresponding to wells on the associated assay plate. Also check for 
clearings in the red starch in the positive controls and in any tested putatives. 
Be wary of contaminants that also form clearing zones in red starch (see 
below). 

10 2. From the solid phase plates containing dilutions of breakout stocks, core 
several isolated plaques* each into SOO uL SM with SO uL CHCI3. This is 
referred to as the isolate stock. 
3. The isolate stocks can then be individually tested on BODIPY starch using 
methods described above. This step can be skipped if the plaque that was 

1 5 cored in step 2 produced a clearing zone in the red starch background. The 

isolate stocks were then be individually tested on BODIPY starch using 
methods described above. However, this step may be skipped if the plaque that 
was cored in step 2 produced a clearing zone in the red starch background. 

20 Excisions 

Day 1 

1 . In a Falcon 2059 tube, mix 200 uL ODl XLl-Blue MRF* host, 100 uL lambda 
isolate stock and 1 uL ExAssist phage stock. 

25 2. Incubate in 37*0 shaker for 15 minutes. 

3. Add 3 mL NZY medium. 

4. Incubate in 30*C shaker overnight. 

Day 2 

30 1. Heat to excision tube to 70T for 20 minutes. 

2. Centrifuge 1 000 x g for 1 0 minutes. 

3. In a Falcon 2059 tube, combine 50 uL supernatant with 200 uL EXP505 ODl 
host. 

4. Incubate in 37**C shaker for 15 minutes. 
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5 5. Add 300 uL SOB medium. 

6. Incubate in 37C shaker for 30-45 minutes. 

7. Plate 50 uL on large LBicanso plate using sterile glass beads. If the plates are 
"dry", extra SOB medium can be added to help disburse the cells. 

8. Incubate plate at 30**C for at least 24 hours. 

10 9. Culture an isolate for sequencing and/or RFLP. 

Growth at 30^C reduces plasmid copy number and is used to mitigate the apparent 
toxicity of some amylase clones. 

15 Contaminants That Form Clearing Zones in Red Starch 

When using red starch on solid medium to assay phage for amylase activity, it 
is common to see contaminating colony forming units (cfii) that form clearing zones 
in the red starch. For pintooled plates, it is important to distinguish amylase-positive 
phage clones from these contaminants whenever they align with a particular well 

20 position. The source of the contaminating microbes is presumably the 2% red starch 
stock solution, which cannot be sterilized by autoclaving or by filtering after 
preparation. It is thought that they are opportunistic organisms that survive by 
metabolizing the red starch. In order to reduce these contaminants, use sterile 
technique when making 2% red starch solutions and store the stocks either at 4°C or 

25 on ice. 



EXAMPLE 8 
BIOINFORMATIC ANALYSIS 
30 An Initial bioinformatic analysis was made with the known 

hyperthermophillic a-amylase sequences. Figure 14a shows an alignment of the 
sequences some of which have been'deposited at the NCBI database. This analysis 
revealed the potential for designing degenerate primers to PGR the entire gene minus 
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5 its signal sequence (see Figure 14a). yielding potentially novel full-length alpha 
amylases from a library. 

The following libraries were screened by PGR from genomic DNA: 
Table 6: 



Library # 


Name 


PGR positive 


Subcloned 


5 


A.lithotropicus 


No 




13 


Pyrodictium occultum 


No 




17 


Pyrodictium TAGIJ 


No 


Yes 


113 


Deep sea enrichment 


Yes 


Yes 


170 


Deep sea enrichment 


Yes 


Yes 


198 


Archaeglobus 


No 




206 


Acidianus sp 


No 




453 


Mixed Iceland enrich 


No 




455 


Mixed iceland enrich 


Yes 


Yes 



Figure 14b shows an alignment of the identified sequences and the 
table below lists their relative percent identities. 



Table 7: Nucleotide sequence % identity 





SEQ ID 
NO.: 81 


pyro 


Pyro 
2 


thermo 


themi2 


SEQID 
NO.: 75 


SEQID 
NO.: 77 


SEQH) 
NO.: 83 


SEQ ID 
NO.: 85 


SEQID 

NO.; 79 




SEQID 
NO.: 81 


100 


91.7 


75.1 


82.1 


80.1 


82.5 


82.6 


82.1 


82.6 


83 




pyro 




100 


74,8 


82.5 


80.5 


82 


82.2 


82.9 


82.8 


84 




Pyro2 






100 


71.5 


71.1 


74 


74.2 


77 


77.1 


73 




thenn 








100 


81.7 


83.5 


83.8 


82.8 


83.2 


83.8 




therm2 










100 


88.9 


88.8 


84.1 


84.7 


84 




SEQ ID 
NO,: 75 












100 


98.3 


84.6 


85.2 


85.5 




SEQ ID 

NO.: 77 














100 


84.8 


84.9 


85.4 
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SEQID 
NO.: 83 
















100 


96 


83.3 


SEQID 
NO.: 85 


















100 


83 


SEQ ID 
NO.: 79 




















100 


Clone A 























5 — 

The amino acid identity ranges from about 85-98% identity. Accordingly, 
these sequences are useful in shuffling of genes as described herein. 



Figure 14c shows the nucleic acid alignment of the corresponding polypeptide 
10 sequences above. Expression of these amylases in the expression vector pSE420 and 
the host cell line XL 1 -Blue showed 1703 and 1706 to have amylase activity. 



EXAMPLE 9 

15 Characterization of Library 63 GP>1 alpha amylase pH optimum and 

specific activity determination 

In initial experiments, the SEQ ID NO: 81 from Thermococcus showed that it 
was effective in both starch liquefaction for com wet milling and desizing for textiles. 

20 This enzyme has a pH optimum of 4.5 to 5.0. At this lower pH, it is possible to use 
little or no calcium which lowers overall operating costs and less byproduct 
formation. In addition, at this low pH, there is decreased chemical usage and ion 
exchange load. The industry standard 5. licheniformis amylase is suboptimal in both 
thermostability and pH optimum. The 63GP-1 amylase has a higher application 

25 specific activity compared to 5. licheniformis amylase and therefore much less 

enzyme is required to hydrolyze a ton of starch (as much as 20-fold less enzyme can 
be used). 
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5 The pH optimum for the hydrolysis of starch was determined by reacting 50 

uL of the GP-1, 0.35 U/ml, with a 100ml of 1% soluble starch solution (0.0175U/g of 
starch) for 30 minutes at 95 degrees C. The reducing ends generated in the liquefied 
starch solution were measured by the neocupronine assay, described herein. The 
percent hydrolysis of cornstarch was determined by measuring the number of sugar 

1 0 reducing ends produced with the neocupronine assay. Seventy grams of buffer 

solution (pH4-7) was weighed and lOOppm of calcium was added. Thirty grams of 
cornstarch was mixed into the buffer solution to forai a starch slurry. The enzyme 
was added and the vessels sealed and incubated at 95 degrees C for 30 minutes with 
an initial heating rate of six degrees C per minute. A 1 ml sample was extracted from 

15 the reaction beakers and analyzed by the neocupronine assay. The optimum for GP-1 
was between pH 4.5 and 5 , while the conmiercial A licheniformis amylase performed 
optimally at about pH 6.0. 



20 EXAMPLE 10 

Amylase Ligation Reassembly 

Nine fragments (each about 150 bp) were ampHfied from each of the parent 
clones SEQ ID NO.: 81, SEQ ID NO.: 77, SEQ ID NO.: 79, covering the whole open 
reading frame. The primers are provided in Table 8. 

25 



TABLE 8 





SEQ ID NO: 




GAACACTAGTAGGAGGTAACTTATGGCAAAGTATTCCGAGCTCGAAG 


258 


Spel 


GAACGGTCTCATTCCGCCAGCCAGCAAGGGGATGAGCGG 


259 


Bsal 


GAACCGTCTCAAAACACGGCCCATGCCTACGGC 


260 


BsmBI 


GAACGTCTCACCTCGACTTCCACCCCAACGAGGTCAAG 


261 


BsmAI 


GAACGTCTCAGGCGCnTGACTACGTGAAGGGC 


262 


BsmAI 


GAACGGTCTCAACAAGATGGATGAGGCCTTTG 


263 


Bsal 


GAACCGTCTCACGATATAATCTGGAACAAGTACCTTGC 


264 


BsmBI 


GAACCGTCTCAGAAGCACGAGCATAGTrTACTACG 


265 


BsmBI 


GAACCGTCTCAAAGGTGGGTTTATGTGCCG 


266 


BsmBI 


GAACGTCTCAGGAATCCAAATGGCGGATATTCCCGC 


267 


BsmAI 
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GAACGGTCTCAGTTTATCATATTGATGAGCTCC 


268 


Bsal 


GAACCGTCTCAGAGGTAGTTGGCAGTATATTTG 


269 


BsmBI 


GAACGTCTCACGCCAGGCATCAACGCCGATG 


270 


BsmAI 


GAACGTCTCATTGTAGTAGAGCGGGAAGTC 


271 


BsmAI 


GAACGGTCTCAATCGGTGTCGTGGTTTGCTAC 


272 


Bsal 


GAACCGTCTCACTTCCACCTGCGAGGTGGTC 


273 


BsmBI 


GAACCGTCTCACCTTCCAACCTTGCTCGAGC 


274 


BsmBI 


TCGAGACTGACTCTCACCCAACACCGCAATAGC 


275 


GAACACTAGTAGGAGGTAACTTATGGCCAAGTACCTGGAGCTCGAAG 
AGG 

UA AL/VjU I I 1 1 V^^a^LA^OoCUAuC AACjGvjC 


276 


Spel 


277 


Bsal 


1^ A A #^/^/^T'/"*r/^ A A A A A ^/^/l/^i^O A /^/~«/>i'>Tr A r^f>^ 

U AAL/l^vj Iv^ 1 \-.AA AAL- ACCCjCCCACCjCC 1 ACGG 


278 


BsmBI 


u A ACO 1 CI C ACC 1 CUAC ITCCACCCC AAC 


279 


BsmAI 


LvA ACU 1 CI CAUvjCOC ITCGACTACGTCAAGG 


280 


BsmAI 


UAACOVJ 1 CI CAAC AAOA i OO ACOCCjOCCTTT GAC 


281 


Bsal 


UA ACCij 1 CI CACO A 1 A 1 AA 1 1 T UOAACAAGTACCC 


282 


BsmBI 


rsA ArT'nx/'^i^Ai^A Ars/^Ar*/^/^ Ar»ATAr"rr«TA/^ 
UAACCO 1 C 1 CAGAAGCACCG ACA 1 AG 1 CI AC 


283 


BsmBI 


UAACCO 1 C I CAAAGG 1 GGG 1 C 1 ACG 1 1 CCG 


284 


BsmBI 


GAACGTCTCAGGAATCCATATTGCGGAGATTCCGGC 


285 


BsmAT 

JUfOlILTXI, 


GAACGGTCTCAGTTTATCATGTTCACGAGCTC 


286 


Bsal 


GAACCGTCTCAGAGGTAGTTGGCCGTGTACTTG 


287 


BsmBI 


GAACGTCTCAGCCATGCGTCAACGCCGATG 


288 


BsmAI 


GAACGTCTCATTGTAGTAGAGCGGGAAGTCG 


289 


BsmAI 


GAACGGTCTCAATCGGTGTCXjTGGnTGCAACG 


290 


Bsal 


GAACCGTCTCACTTCCACCGGCGAGGTGGTCGTG 


291 


BsmBI 


GAACCGTCTCACCTTCCGGCCTTGCTCGAGCC 


292 


BsmBI 


TCGAGACTGACTCTCAGCCCACCCCXjCAGTAGCTC 


293 




gaacactagtaggaggtaacttatggccaagtactcx:gagctggaag 

AGG 

OAACUVJ 1 C 1 CA 1 1 CC 1 CCCGCGAGC AAGGG 


294 


Spel 


295 


Bsal 


OAACCO 1 C 1 CAAAAC ACCGCCC ACGCC 1 A 1 G 


296 


BsmBI 


ijAAl^vj 1 1 C-ACC 1 CO AC 1 1 CCACCCGAACGAGC 


297 


BsmAI 




298 


BsmAI 


n A A pnrsTPTP A A p A A A T/^f^ A f^n A r;r5pn*rr»r; 

VJAAv-'VlvJ I ^ AAU A 1 vJOAl^VJAvJVJCC 1 1 CVJ 


299 


Bsal 


ftA APPHTPTPAPrS ATATA ATPmnA AP A An 
WAAV-'^VJ \^J\\AJJ\ A A 1 AA 1^1 OVJAA^AAU 


300 


BsmBI 


CIA APPnTPTPAHA AITPAPTOAPATPriTTTAPTAPn 
O AA V^^VJ 1 1 A VI AA VJC AC 1 Ll AC A 1 CO ill AC 1 ACG 


301 


BsmBI 


P A A PPPTPTP AAA nnTPPnTTT A P/^XTPrv^ 
VJAAL<V-.VJ IK^I \^AAAO\J 1 OVJvJ 111 ACO 1 1 CCVJ 


302 


BsmBI 


GAACGTCTCAGGAATCCATATCGCCGAAAT 




Dom A T 

osmAi 


GAACGGTCTCAGTTTATCATGTTTATGAGC 


304 


Bsal 


GAACCGTCTCAGAGGTAGTTGGCCGTGTATTTAC 


305 


BsmBI 


GAACGTCTCACGCCAGGCATCGATGCCGAT 


306 


BsmAI 


GAACGTCTCATTGTAGTAGAGGGCGAAGTCAAAG 


307 


BsmAI 


GAACGGTCTCAATCGGTATCGTGGTTGGCTACAAAC 


308 


Bsal 


GAACCGTCTCACTTCCTCCGGCGAGGTTGTCATG 


309 


BsmBI 


GAACCGTCTCACCTTCCGGCTTTGCTTGAGGC 


310 


BsmBI 


TCGAGACTGACTCTCACCCAACACCGCAGTAGCTCC 


311 




CACACAGCAGCAACCAACCTCGAGACTGACTCTCASCC 


312 


Bbvl 
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5 Conditions used for PGR were as follows: 3 min 94**C, (30 sec 94*^0; 30 sec 

55*^0, 30 sec 68**C) x 30 cycles, followed by 10 min PGR products 
corresponding to homologous regions from the three parents were pooled (1:1 :1), cut 
with the ^propriate restriction enzyme (see Table 8), and gel-purified. Equal 
amounts of fragment pools were combined and ligated (16 ^G; over night). The 

10 resulting 450 bp ligation products were gel purified and ligated to yield fiill length 
amylase genes. The resulting full length products were gel-purified and PGR 
amplified using a mixture of Fl primers SEQ ID NO.: 81, SEQ ID NO.: 77. SEQ ID 
NO.: 79 and primer (SEQ ID NO: 312). Gonditions used for PGR were as follows: 3 
min 94^G, (30 sec 94^C; 30 sec 50^G, 60 sec 68*»G) x 30 cycles, followed by 10 min 

1 5 68*'G. The resulting PGR products (--1 .4 kbp) were purified, cut with Spel and Bbvl, 
gel-purified, ligated into pMYG (vector from Mycogen, cut with Spel/Xhol), and 
transformed into E.coli ToplO. Plasmid DNA from a pool of -21000 colonies was 
isolated and transformed into Pseudomonas, 

20 Screening of Reassembled g-amvlasc 

The transformed Pseudomonas fluorescens (MB214) containing pMYG 
derived from the parent clones SEQ ID NO.: 81, SEQ ID NO.: 77, SEQ ID NO.: 79 
were sorted to 96- or 384-well plates by FAGS and treated with 6M urea. Primary 
screening using RBB-starch and/or FITG-starch as substrates was carried out as 
25 described more fully below. Elevated active clones were screened using RBB-starch 
as substrate using induced cultures and by liquefaction assay. Stock and sequencing 
new elevated active clones based on Uquefaction data was performed. 

The transformed reassembled amylase library (MB214 (Pf)), were collected 
30 and sorted into 96-well plates (or 384-well plates) at 1 cell/well in 50^1 of LB+Tet. 
The plates were incubated for 24 hours at 30''G. RepUcate plates were made 
corresponding to each well for storage. Forty-five (45) |U of 12M urea was added to 
each well and the plates were shaken for 10 minutes. Plates were kept at room temp 
for at least 1 hour and the lysate stored at 4^G. 



wo 02/068589 



PCT/US02/05068 



119 

5 

Assay Using RBB-starch 

75^1 of RBB-starch substrate (1% RBB-insoluble com starch in 50mM NaAc 
buffer, pH^.5) was added into each well of a new 96-well plate (V-bottom), Five 
micro-liters of enzyme lysate was transfered into each well with substrate using 
Biomek or Zymark. The plates were sealed with aluminum sealing tape and shaken 
briefly on the shaker. The plates were incubated at 90°C for 30 minutes, followed by 
cooling at room temperature for about 5 to 10 minutes. One hundred micro-liters of 
100% ethanol was added to each well, the plates sealed and shaken briefly on the 
shaker. The plates were then centrifuged 4000rpm for 20 minutes using bench-top 
centrifuge. lOOjil of the supernatant was transferred into a new 96-well plate (flat 
bottom) by Biomek and read OD59S. Controls: SEQ ID NO.: 81, SEQ ID NO.: 77, 
SEQ ID NO.: 79. 

Assay using FITC-starch 

Added 50^1 of substrate (0.01% FITC-starch in lOOmM NaAc buffer, pH=4.5) 
into each well of a new 384-well plate. Transfered 5\il of enzyme lysate into each 
well with substrate and incubated the plate at room temperature overnight. The 
polarization change of the substrate, excitation 48Snm, emission S35nm, was read for 
each well. Controls: SEQ ID NO,: 81, SEQ ID NO.: 77. SEQ ID NO.: 79. Preferably 
96 well plates are used for all assays. 

Confirmation of new active clones 

Each positive clone from screening was grown and induced using a 
standard protocol. Each clone was examined for growth (i.e., cell density over time), 
activity at per cell level (RBB-starch assay and liquefaction assay), expression 
(protein gel) and solubility of protein (by microscope analysis). The confirmed new 
elevated clones were transferred for fermentation. 
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While the invention has been described in detail with reference to certain 
preferred embodiments thereof, it will be understood that modifications and variations 
are within the spirit and scope of that which is described and claimed. 
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Table 3 



SEQ ID NO. Signal Sequence 

SEQ ID NO: 87 AA1-23 (SEQ ID NO:213) 

SEQ ID NO: 91 AA1-23 (SEQ ID NO: 214) 

SEQ ID NO: 93 AA1-33 (SEQ ID NO: 215) 

SEQ ID NO: 97 AAI-SI (SEQ ID NO: 216) 

SEQ ID NO: 99 AAI-SO (SEQ ID NO: 217) 

SEQ ID NO: 103 AA1-22 (SEQ ID NO: 218) 

SEQ ID NO: 105 AA1-33 (SEQ ID NO: 219) 

SEQ ID NO: 109 AA1-25 (SEQ ID NO: 220) 

SEQ ID NO: 111 AA1-35 (SEQ ID NO: 221) 

SEQ ID NO: 1 13 AA1-28 (SEQ ID NO: 222) 

SEQ ID NO: 1 17 AA1-21 (SEQ ID NO: 223) 

SEQ ID NO: 1 19 AA1-30 (SEQ ID NO: 224) 

SEQ ID NO: 123 AA1-35 (SEQ ID NO: 225) 

SEQ ID NO: 125 AA1-28 (SEQ ID NO: 226) 

SEQ ID NO: 127 AA1-30 (SEQ ID NO: 227) 

SEQ ID NO: 131 AA1-30 (SEQ ID NO: 228) 

SEQ ID NO: 133 AA1-30 (SEQ ID NO: 229) 

SEQ ID NO: 137 AA1-28 (SEQ ID NO: 230) 

SEQ ID NO: 139 AA1-23 (SEQ ID NO: 231) 

SEQ ID NO: 141 AA1-23 (SEQ ID NO: 232) 

SEQ ID NO: 143 AA1-30 (SEQ ID NO: 233) 

SEQ ID NO: 145 AA1-27 (SEQ ID NO: 234) 

SEQ ID NO: 147 AA1-29 (SEQ ID NO: 235) 

SEQ ID NO: 149 AA1-28 (SEQ ID NO: 236) 

SEQ ID NO: 69 Ml -27 (SEQ ID NO: 237) 

SEQ ID NO: 153 AA1-26 (SEQ ID NO: 238) 

SEQ ID NO: 155 AA1-33 (SEQ ID NO: 239) 

SEQ ID NO: 157 AA1-25 (SEQ ID NO: 240) 

SEQ ID NO: 159 AA1-25 (SEQ ID NO: 241) 

SEQ I D NO: 1 61 AA1 -36 (SEQ ID NO: 242) 

SEQ ID NO: 167 AA1-36 (SEQ ID NO: 243) 

SEQ ID NO: 169 AA1-23 (SEQ ID NO: 244) 

SEQ ID NO: 173 AA1.25 (SEQ ID NO: 245) 

SEQ ID NO: 175 AA1-22 (SEQ ID NO: 246) 

SEQ ID NO: 177 AA1-23 (SEQ ID NO: 247) 

SEQ ID NO: 179 AA1-23 (SEQ ID NO: 248) 

SEQ ID NO: 185 AA1*25 (SEQ ID NO: 249) 

SEQ ID NO: 189 AA1.36 (SEQ ID NO: 250) 

SEQ ID NO: 191 AA1-25 (SEQ ID NO: 251) 

SEQ ID NO: 193 AA1.25 (SEQ ID NO: 252) 

SEQ ID NO: 197 AA1-23 (SEQ ID NO: 253) 

SEQ ID NO: 199 AA1-23 (SEQ ID NO: 254) 

SEQ ID NO: 201 AA1 -30 (SEQ ID NO: 255) 

SEQ ID NO: 203 AA1-25 (SEQ ID NO: 256) 

SEQ ID NO: 205 AA1-16 (SEQ ID NO: 257) 

SEQ ID NO.:73 AA1-16 (SEQ ID NO: 7) 

SEQ ID NO.: 79 AA1-26 (SEQ ID NO: 8) 
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5 WHAT IS CLAIMED IS; 

1 . An isolated nucleic acid comprising a sequence that encodes a polypq)tide 
having alpha amylase activity, wherein said sequence is selected from the group 
consisting of: 

SEQ ID NOS: 1, 3, 5, 9. 11, 13, 15. 17, 19. 21. 23, 25, 27, 29, 31, 33, 35, 37. 

10 39, 41, 43, 45 ,47, 49. 51, 53, 55, 57, 59, 61, 63, 65, 67. 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87,89.91,93,95,97, 99, 101, 103, 105, 107. 109, 111, 113, 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163. 165, 167. 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211. 213. 215. 217. 219. 221. 223, 

15 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 
259, 261, 263, 265, 267, 269. 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 
293, 295, 297, 299; 

variants having at least about 50% homology to at least one of SEQ ID NOS: 
1,3,5, 9, 11, 13, 15. 17, 19,21,23,25,27, 29,31,33,35,37, 39,41,43,45,47,49, 

20 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 

25 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 
267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
over a region of at least about 100 residues, as determined by analysis with a sequence 
comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS: 1, 3, 5, 9. 1 1, 13, 15. 17, 19, 21, 

30 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 
145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
179. 181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201,203,205,207,209,211, 

35 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
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5 247, 249, 251, 253, 255, 257, 259, 261, 263. 265, 267. 269, 271, 273, 275, 277, 279, 
281, 283, 285, 287, 289, 291, 293, 295, 297, 299; and 

sequences complementary to variants having at least about 50% homology to 
SEQ n>NOS: 1, 3. 5, 9, 1 1. 13, 15, 17, 19, 21, 23, 25, 27, 29, 3 1, 33, 35, 37, 39, 41, 
43, 45 ,47, 49, 51, 53, 55, 57, 59. 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 

10 89,91,93,95,97,99, 101, 103, 105, 107, 109, 111. 113, 115, 117, 119, 121, 123, 
125. 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149. 151, 153, 155, 157. 
159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 

15 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 
295, 297, 299, over a region of at least about 100 residues, as determined by analysis 
with a sequence comparison algorithm or by visual inspection. 

2. The isolated nucleic acid as claimed in claim 1, wherein the isolated nucleic acid 

20 comprises a complementary sequence that hybridizes under conditions of high 
stringency to a sequence selected fiom the group consisting of: SEQ ID NOS: 1, 
3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33. 35, 37, 39. 41, 43. 45 ,47, 49, 
51, 53, 55, 57. 59, 61, 63, 65, 67, 69, 71, 73. 75, 77. 79. 81, 83. 85, 87, 89. 91, 93, 
95,97, 99, 101, 103, 105, 107, 109, 111, 113, 115. 117, 119. 121. 123, 125, 127, 

25 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197, 199, 201, 203, 205, 207, 209, 211. 213, 215, 217, 219, 221, 223, 
225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 
257, 259, 261, 263, 265, 267, 269, 271, 273, 275. 277, 279, 281, 283, 285, 287, 

30 289, 291 . 293, 295, 297, 299, and variants having at least about 50% homology to 
at least one of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135. 137, 139, 141, 143, 145, 

35 147, 149, 151, 153, 155, 157, 159, 161, 163, 165. 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185. 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
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5 211, 213, 215, 217, 219, 221. 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 
275, 277, 279, 281, 283, 285, 287. 289, 291, 293, 295, 297, 299 over a region of at 
least about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

10 

3. The isolated nucleic acid as claimed in claim 1, wherein the isolated nucleic acid 
comprises a complementary sequence that hybridizes under conditions of 
moderate stringency to a sequence selected from the group consisting of: SEQ ID 
NOS: 1. 3, 5, 9, 1 1, 13. 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 

15 ,47, 49, 51, 53, 55, 57. 59, 61, 63, 65, 67, 69. 71, 73. 75, 77, 79, 81, 83, 85, 87, 89, 
91,93,95,97,99, 101, 103, 105, 107. 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147. 149, 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179. 181, 183, 185, 187, 
189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 

20 221, 223, 225, 227. 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 
253, 255. 257, 259, 261, 263, 265. 267, 269, 271, 273, 275, 277, 279, 281, 283. 
285, 287, 289, 291, 293, 295, 297, 299, and variants having at least about 50% 
homology to at least one of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33. 35, 37, 39. 41, 43, 45 ,47. 49, 51, 53, 55, 57. 59, 61, 63. 65, 67, 69, 

25 71. 73, 75, 77, 79. 81. 83. 85. 87, 89. 91, 93, 95. 97. 99, 101, 103. 105, 107. 109. 
111. 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141. 
143, 145, 147. 149. 151. 153. 155, 157, 159, 161, 163. 165, 167, 169. 171, 173, 
175, 177, 179, 181. 183, 185. 187. 189. 191. 193, 195, 197. 199,201,203.205, 
207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 

30 239, 241, 243. 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265. 267. 269, 
271, 273, 275, 277, 279, 281. 283. 285, 287, 289. 291, 293, 295, 297, 299 over a 
region of at least about 100 residues, as determined by analysis with a sequence 
conq)arison algorithm or by visual inspection. 

35 4. The isolated nucleic acid as claimed in claim 1, wherein the isolated nucleic acid 
comprises a complementary sequence that hybridizes under conditions of low 
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5 stringency to a sequence selected from the group consisting of: SEQ ID NOS: 1 , 
3, 5, 9, 11, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 
51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95.97,99, 101. 103. 105, 107, 109. Ill, 113, 115, 117. 119. 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 

10 161, 163, 165. 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195. 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221. 223, 
225, 227. 229. 231. 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 
257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 
289, 291, 293, 295, 297, 299, and variants having at least about 50% honaology to 

15 at least one of SEQ ID NOS: 1, 3, 5, 9, 11, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31. 

33, 35, 37. 39, 41. 43, 45 .47, 49. 51. 53. 55, 57, 59, 61, 63, 65, 67. 69, 71, 73, 75, 
77, 79. 81, 83, 85, 87, 89, 91, 93, 95, 97, 99. 101. 103, 105, 107, 109, 111, 113, 
115, 117. 119, 121, 123, 125. 127, 129, 131. 133. 135, 137, 139, 141, 143, 145, 
147, 149. 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 

20 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231. 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271. 273, 
275, 277, 279, 281, 283, 285, 287, 289. 291, 293, 295, 297, 299 over a region of at 
least about 100 residues, as detennined by analysis with a sequence comparison 

25 algorithm or by visual inspection. 

5. The isolated nucleic acid as claimed in claim 1 , wherein said variants have at least 
about 50% homology to at least one of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 

30 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85. 87, 89. 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111. 113, 115, 117. 119. 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149. 151, 153, 155, 157, 159, 161, 163. 165, 167, 169, 
171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191. 193, 195, 197, 199, 201, 
203, 205, 207. 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231. 233. 

35 235, 237, 239. 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 
267, 269. 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
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S 299, over a region of at least about 200 residues, as detennined by analysis with a 
sequence comparison algorithm. 

6. The isolated nucleic acid according to claim 1, wherein said variants have at least 
about 50% homology to at least one of SEQ ID NOS: 1 , 3, 5, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 

10 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41,43. 45 ,47, 49. 51, 53, 55, 57. 59, 61, 63, 
65, 67. 69, 71, 73. 75, 77, 79, 81. 83, 85, 87. 89, 91. 93, 95. 97. 99, 101, 103, 105, 
107, 109, 111, 113, 115. 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137. 
139, 141, 143, 145, 147, 149, 151. 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171, 173, 175, 177, 179, 181. 183. 185, 187, 189, 191, 193, 195, 197, 199, 201. 

15 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227. 229. 231, 233, 
235, 237, 239, 241, 243, 245, 247, 249, 251. 253. 255. 257. 259, 261, 263, 265, 
267, 269, 271. 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297. 
299 over the entire sequence. 

20 7. The isolated nucleic acid according to claim 1 , 2, 3. 4. 5 or 6, wherein said 

variants have at least about 55% homology to at least one of SEQ ID NOS: 1, 3. 5, 
9. 11, 13, 15, 17, 19, 21. 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 .47, 49, 51, 
53, 55. 57. 59, 61, 63, 65, 67, 69, 71. 73. 75, 77, 79. 81, 83, 85, 87, 89, 91, 93, 95. 
97, 99, 101. 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 

25 131. 133, 135. 137, 139, 141, 143, 145, 147. 149, 151, 153, 155, 157, 159, 161, 
163, 165, 167. 169, 171, 173, 175, 177, 179, 181, 183. 185. 187. 189, 191, 193, 
195, 197, 199, 201, 203, 205. 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 
227. 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 

30 291,293,295,297,299. 

8. The isolated nucleic acid according to claim 1, 2, 3, 4, 5 or 6, wherein said 

variants have at least about 60% homology to at least one of SEQ ID NOS: 1, 3, 5, 
9, 11, 13, 15, 17, 19,21,23,25,27,29,31,33, 35, 37, 39,41,43,45.47,49,51. 
35 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85. 87. 89, 91, 93, 95, 
97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
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5 131, 133, 135, 137. 139, 141, 143. 145. 147, 149, 151, 153, 155, 157, 159. 161, 
163. 165. 167. 169. 171. 173. 175. 177, 179. 181, 183, 185. 187, 189. 191. 193, 
195, 197. 199, 201. 203, 205. 207, 209. 21 1, 213, 215, 217, 219, 221, 223, 225, 
227, 229, 231. 233, 235, 237, 239. 241, 243. 245, 247, 249. 251, 253. 255. 257. 
259. 261, 263, 265, 267, 269, 271. 273, 275, 277, 279, 281. 283. 285, 287. 289. 
10 291,293,295.297,299. 

9. The isolated nucleic acid according to claim 1 . 2, 3. 4. 5 or 6. wherein said 
variants have at least about 65% homology to at least one of SEQ ID NOS: 1, 3, 5. 
9, 1 1, 13, 15. 17, 19. 21. 23, 25, 27, 29, 31, 33. 35, 37, 39, 41, 43. 45 ,47, 49, 51, 

15 53, 55, 57, 59. 61, 63, 65, 67, 69. 71, 73, 75, 77, 79. 81, 83, 85. 87, 89, 91, 93. 95, 
97. 99. 101, 103, 105, 107, 109, 111. 113, 115, 117, 119, 121, 123. 125, 127, 129, 
131, 133. 135. 137, 139. 141, 143, 145, 147, 149, 151. 153, 155. 157, 159. 161, 
163. 165, 167, 169, 171, 173. 175, 177, 179, 181, 183, 185, 187. 189. 191. 193. 
195. 197, 199, 201. 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223. 225, 

20 227, 229, 231. 233. 235. 237, 239, 241, 243, 245. 247, 249. 251. 253. 255. 257. 
259. 261, 263, 265, 267. 269. 271. 273. 275. 277, 279, 281, 283, 285, 287, 289, 
291,293,295,297,299. 

10. The isolated nucleic acid according to claim 1, 2, 3, 4, 5 or 6, wherein said 

25 variants have at least about 70% homology to at least one of SEQ ID NOS: 1 , 3, 5. 
9, 11, 13, 15, 17, 19. 21. 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47. 49. 51, 
53, 55, 57, 59, 61, 63. 65. 67, 69. 71, 73. 75. 77, 79. 81. 83. 85, 87, 89, 91, 93, 95. 
97,99, 101. 103, 105. 107. 109. Ill, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
131, 133. 135. 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 

30 163, 165, 167, 169. 171. 173, 175, 177, 179. 181. 183, 185, 187. 189. 191, 193, 
195, 197. 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225. 
227, 229. 231, 233, 235, 237, 239, 241, 243, 245, 247, 249. 251. 253, 255, 257, 
259, 261, 263, 265, 267, 269. 271, 273, 275, 277, 279, 281. 283. 285. 287, 289. 
291,293,295, 297,299. 
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S 11. The isolated nucleic acid according to claim 1,2,3, 4,5 or 6, wherein said 

variants have at least about 75% homology to at least one of SEQ ID NOS: 1,3,5, 
9, 11. 13, 15. 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 
53, 55, 57. 59, 61, 63. 65, 67. 69. 71. 73, 75, 77. 79, 81, 83. 85, 87, 89, 91, 93, 95, 
97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 

10 131, 133, 135, 137, 139, 141. 143. 145, 147, 149, 151, 153, 155, 157, 159, 161, 
163, 165. 167, 169, 171, 173, 175. 177. 179, 181. 183, 185, 187, 189, 191, 193, 
195, 197. 199. 201, 203. 205, 207, 209. 211. 213, 215, 217, 219, 221, 223, 225, 
227, 229. 231. 233, 235, 237, 239, 241, 243. 245, 247, 249. 251. 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 

15 291,293,295,297,299. 

12. The isolated nucleic acid according to claim 1, 2, 3. 4. 5 or 6. wherein said 

variants have at least about 80% homology to at least one of SEQ ID NOS: 1. 3, 5, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 

20 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91. 93, 95, 
97, 99, 101, 103, 105, 107. 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 121, 123, 125, 127, 129, 
131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159. 161, 
163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 
195, 197. 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 

25 227, 229, 231. 233, 235. 237, 239, 241, 243, 245, 247, 249. 251. 253, 255, 257. 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291,293,295,297,299. 



13. The isolated nucleic acid according to claim 1, 2, 3, 4, 5 or 6, wherein said 
30 variants have at least about 85% homology to at least one of SEQ ID NOS: 1, 3, 5, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47,49. 51. 
53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97,99, 101, 103, 105, 107, 109. 111. 113. 115, 117, 119, 121, 123, 125, 127, 129, 
131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 
35 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189. 191, 193, 
195. 197. 199. 201. 203. 205. 207. 209, 211, 213, 215. 217, 219, 221, 223, 225, 
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5 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 
259. 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291,293,295,297,299. 

14. The isolated nucleic acid according to claim 1, 2, 3, 4, 5 or 6, wherein said 

10 variants have at least about 90% homology to at least one of SEQ ID NOS: 1 . 3, 5, 
9, 11, 13, 15, 17. 19, 21, 23. 25, 27, 29. 31. 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 
53. 55, 57, 59, 61. 63, 65, 67, 69, 71. 73. 75. 77, 79, 81, 83, 85, 87, 89. 91, 93, 95, 
97,99, 101, 103. 105, 107, 109. Ill, 113, 115, 117, 119, 121, 123, 125, 127. 129, 
131, 133, 135. 137, 139, 141. 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 

15 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183. 185, 187, 189, 191, 193. 
195, 197, 199, 201, 203, 205, 207, 209, 211. 213, 215, 217, 219, 221, 223, 225, 
227, 229, 231. 233, 235, 237, 239, 241. 243, 245. 247. 249, 251, 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291,293.295, 297,299. 

20 

15. The isolated nucleic acid according to claim 1, 2, 3, 4, 5 or 6, wherein said 
variants have at least about 95% homology to at least one of SEQ ID NOS: 1, 3, 5, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 
53, 55, 57, 59, 61, 63, 65, 67, 69. 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 

25 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
131. 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 
163, 165, 167, 169, 171, 173, 175, 177, 179, 181. 183, 185, 187, 189, 191, 193, 
195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 

30 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287. 289, 
291,293. 295.297.299. 

16. The isolated nucleic acid of claim 1, wherein the sequence comparison algorithm 
is FASTA version 3.0t78 with the defauh parameters. 

35 



wo 02/068589 



PCT/US02/05068 



130 

S 1 7. An isolated nucleic acid comprising at least 10 consecutive bases of a sequence 
selected firom the group consisting of: 

SEQ ID NOS: 1, 3, 5, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
35. 37. 39. 41, 43. 45 .47. 49. 51, 53. 55. 57. 59, 61. 63, 65, 67. 69, 71, 73, 75, 77. 79. 
81.83, 85,87,89,91,93.95,97.99. 101, 103, 105, 107, 109, 111, 113. 115. 117, 
10 119, 121. 123, 125, 127, 129, 131, 133, 135. 137. 139. 141, 143. 145, 147, 149, 151, 
153. 155. 157, 159. 161, 163, 165, 167, 169, 171, 173, 175, 177. 179. 181, 183, 185, 
187, 189, 191. 193, 195, 197. 199, 201. 203. 205. 207. 209, 211, 213. 215. 217, 219. 
221, 223, 225. 227, 229, 231. 233, 235, 237, 239, 241, 243, 245. 247. 249, 251, 253, 
255, 257, 259, 261. 263. 265, 267, 269. 271, 273. 275. 277, 279. 281, 283, 285, 287, 
15 289,291,293.295.297.299; 

variants having at least about 50% homology to at least one of SEQ ID 
NOS: 1, 3. 5, 9, 1 1. 13, 15, 17, 19, 21. 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 
,47, 49, 51, 53, 55, 57. 59. 61, 63, 65, 67. 69. 71. 73. 75. 77, 79, 81, 83, 85. 87, 89. 
91.93.95.97,99. 101. 103. 105, 107. 109, 111. 113. 115, 117, 119, 121, 123, 
20 125, 127. 129, 131. 133, 135, 137. 139. 141, 143. 145, 147, 149. 151, 153, 155, 
157, 159, 161, 163, 165, 167. 169. 171, 173, 175. 177, 179, 181. 183, 185, 187, 
189. 191, 193. 195, 197, 199. 201, 203. 205, 207. 209, 21 1, 213, 215, 217. 219. 
221. 223, 225. 227. 229. 231. 233, 235. 237. 239. 241. 243. 245, 247, 249. 251. 
253. 255, 257, 259. 261, 263, 265, 267. 269, 271, 273, 275, 277, 279. 281, 283, 
25 285, 287. 289, 291 , 293, 295. 297, 299 over a region of at least about 100 

residues, as determined by analysis with a sequence comparison algorithm or by 
visual inspection; 

sequences complementary to SEQ ID NOS: 1, 3, 5, 9, 1 1 , 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41. 43, 45 .47, 49, 51, 53, 55, 57, 59, 61, 
30 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89. 91, 93, 95. 97. 99, 101. 103, 
105, 107, 109, 111. 113. 115, 117, 119. 121. 123. 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167. 
169, 171, 173. 175. 177. 179, 181, 183, 185, 187, 189, 191, 193, 195. 197, 199, 
201, 203, 205, 207, 209, 21 1, 213. 215, 217, 219, 221. 223. 225, 227, 229. 231. 
35 233. 235, 237, 239, 241. 243, 245, 247, 249, 251, 253, 255, 257, 259, 261. 263. 
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5 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 
297, 299; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31, 33, 
35, 37. 39, 41, 43. 45 ,47, 49. 51, 53, 55. 57. 59. 61, 63, 65. 67. 69. 71. 73, 75, 77, 
10 79, 81, 83, 85, 87, 89. 91. 93, 95. 97, 99. 101. 103. 105. 107. 109, 111. 1 13. 1 15, 
117. 119, 121, 123, 125. 127. 129. 131. 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153. 155. 157, 159, 161, 163, 165. 167, 169, 171. 173, 175. 177, 179, 
181, 183, 185, 187. 189, 191. 193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 
213, 215, 217. 219, 221, 223, 225, 227. 229, 231. 233, 235. 237, 239, 241, 243, 
15 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 
277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299 over a region of at 
least about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

18. The isolated nucleic acid as claimed in claim 17, wherein said sequence has at 
least about 50% homology to a sequence selected from the group consisting of 
SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27. 29. 31. 33. 35, 37. 39, 
41, 43, 45 ,47, 49, 51, 53. 55. 57. 59. 61, 63, 65, 67, 69, 71. 73, 75, 77. 79. 81. 83. 
85, 87, 89, 91, 93, 95, 97. 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 
153, 155, 157, 159, 161, 163, 165. 167, 169, 171, 173, 175, 177, 179, 181, 183, 
185, 187, 189. 191. 193, 195, 197, 199. 201, 203, 205, 207, 209, 211, 213, 215, 
217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263. 265, 267, 269, 271, 273, 275, 277. 279. 
281. 283, 285, 287, 289, 291, 293, 295. 297, 299, over a region of at least about 
200 residues. 

19. The isolated nucleic acid as claimed in claim 17, wherein said sequence has at 
least about 50% homology to a sequence selected from the group consisting of 

35 SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23. 25, 27, 29, 31, 33, 35, 37, 39, 
41. 43. 45 .47, 49, 51, 53. 55. 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 



20 



25 



30 
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5 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 
153, 155, 157, 159, 161, 163, 165, 167, 169. 171, 173, 175. 177, 179, 181, 183, 
185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207. 209, 211. 213, 215, 
217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
10 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 
281, 283, 285, 287, 289, 291, 293, 295, 297, 299, over the entire sequence. 

20. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
has at least about 55% homology to a sequence selected from the group consisting 

15 of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17. 19. 21. 23, 25, 27, 29, 31, 33, 35, 37, 
39. 41, 43, 45 .47, 49, 51, 53, 55, 57, 59, 61, 63, 65. 67, 69, 71, 73, 75, 77, 79, 81, 
83, 85. 87, 89, 91, 93, 95. 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 
119, 121, 123, 125, 127, 129, 131, 133. 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165. 167. 169, 171, 173, 175, 177, 179, 181, 

20 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231. 233. 235. 237. 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257. 259, 261, 263. 265. 267. 269. 271, 273, 275, 277, 
279, 281, 283. 285, 287, 289, 291, 293, 295, 297, 299. 

25 21. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
has at least about 60% homology to a sequence selected from the group consisting 
of SEQ ID NOS: 1, 3, 5, 9. 1 1. 13. 15, 17, 19, 21, 23, 25. 27. 29. 31, 33. 35. 37, 
39, 41, 43, 45 ,47, 49, 51, 53. 55, 57, 59, 61, 63, 65. 67. 69. 71, 73, 75, 77, 79, 81, 
83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 

30 119, 121, 123, 125, 127, 129. 131, 133, 135. 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165. 167, 169, 171, 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197, 199.201,203,205.207.209.211.213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 

35 279, 28 1 , 283, 285, 287, 289. 29 1 , 293, 295, 297, 299. 
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S 22. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
has at least about 65% homology to a sequence selected from the group consisting 
of SEQ IDNOS: 1, 3, 5, 9, 11, 13. 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35. 37, 
39. 41, 43, 45 ,47, 49. 51, 53, 55, 57, 59. 61. 63, 65, 67, 69, 71, 73, 75, 77, 79, 81. 
83, 85, 87. 89. 91. 93. 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 113. 115. 117, 

10 119, 121. 123. 125, 127, 129, 131, 133, 135, 137, 139. 141, 143, 145, 147, 149, 
151, 153. 155. 157, 159. 161. 163. 165, 167, 169. 171, 173, 175, 177, 179, 181, 
183. 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207. 209, 21 1. 213. 
215, 217. 219. 221. 223, 225, 227, 229, 231, 233, 235, 237. 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259, 261, 263. 265, 267. 269, 271, 273, 275, 277, 

15 279,281,283.285.287.289.291.293,295,297.299. 

23. The isolated nucleic acid as claimed in claim 17, 18 or 19. wherein said sequence 
has at least about 70% homology to a sequence selected from the group consisting 
of SEQ ID NOS: 1, 3. 5, 9. 1 1. 13, 15, 17. 19. 21. 23, 25, 27, 29, 31, 33. 35, 37, 

20 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65. 67, 69, 71, 73, 75. 77, 79. 81. 
83. 85, 87. 89. 91, 93, 95, 97, 99, 101, 103, 105, 107, 109. 111. 1 13, 115. 117, 
119, 121, 123. 125, 127. 129. 131. 133. 135, 137. 139. 141, 143, 145, 147. 149. 
151. 153. 155, 157. 159. 161, 163, 165, 167, 169, 171, 173. 175. 177, 179. 181, 
183, 185, 187, 189, 191. 193, 195, 197, 199, 201, 203. 205, 207, 209, 21 1. 213. 

25 215, 217. 219, 221. 223. 225. 227, 229, 23 1, 233, 235. 237. 239. 241, 243. 245. 
247. 249. 251. 253. 255. 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299. 

24. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
30 has at least about 75% homology to a sequence selected from the group consisting 

of SEQ ID NOS: 1. 3. 5, 9, 1 1, 13, 15, 17, 19, 21. 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45 .47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75. 77, 79, 81. 
83, 85, 87, 89. 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 1 13, 115, 117, 
119. 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141. 143, 145, 147. 149. 
35 151, 153, 155. 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177. 179, 181. 
183. 185, 187, 189, 191. 193. 195. 197. 199, 201. 203. 205. 207. 209, 211. 213, 
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5 215. 217, 219, 221, 223. 225, 227, 229. 231. 233, 235. 237. 239, 241. 243. 245, 
247. 249, 251, 253. 255. 257, 259. 261. 263. 265, 267. 269, 271, 273, 275. 277. 
279. 281. 283. 285, 287, 289, 291, 293, 295, 297, 299. 

25. Hie isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
10 has at least about 80% homology to a sequence selected from the group consisting 

of SEQ ID NOS: 1, 3. 5. 9, 1 1, 13, 15, 17. 19, 21, 23, 25, 27, 29. 31, 33, 35. 37. 
39, 41, 43. 45 .47. 49. 51. 53. 55. 57, 59. 61, 63, 65. 67, 69. 71. 73. 75, 77. 79. 81. 
83. 85, 87. 89. 91. 93. 95, 97. 99. 101. 103, 105. 107. 109, 1 1 1. 113, 1 15, 1 17, 
119, 121, 123. 125. 127. 129, 131, 133. 135. 137, 139. 141, 143, 145, 147, 149, 
15 151. 153, 155, 157, 159, 161, 163, 165, 167, 169, 171. 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197. 199, 201. 203. 205, 207, 209, 211, 213, 
215. 217, 219, 221, 223. 225. 227. 229. 231, 233, 235. 237, 239, 241, 243. 245, 
247, 249, 251. 253. 255, 257, 259, 261, 263, 265. 267, 269, 271, 273. 275. 277. 
279, 281, 283, 285, 287, 289, 291. 293. 295, 297, 299. 

20 

26. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
has at least about 85% homology to a sequence selected from the group consisting 
ofSEQIDNOS: 1,3. 5. 9.11.13. 15.17, 19.21,23,25,27, 29,31,33,35, 37, 
39, 41, 43. 45 ,47, 49. 51. 53. 55. 57. 59, 61, 63, 65, 67, 69. 71, 73, 75, 77, 79, 81, 

25 83. 85. 87, 89, 91, 93,95, 97, 99, 101. 103, 105. 107. 109. 111. 113. 115, 117. 

119, 121. 123. 125. 127, 129, 131, 133, 135, 137. 139, 141, 143, 145. 147. 149. 

151. 153. 155, 157, 159, 161. 163. 165, 167, 169, 171, 173, 175, 177. 179. 181. 

183. 185. 187, 189. 191, 193, 195, 197, 199, 201. 203, 205. 207. 209. 211. 213. 

215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241. 243, 245, 
30 247, 249, 251, 253. 255, 257. 259. 261. 263. 265, 267, 269. 271. 273. 275. 277. 

279, 281, 283, 285, 287, 289, 291. 293, 295. 297. 299. 

27. The isolated nucleic acid as claimed in claim 17. 18 or 19, wherein said sequence 
has at least about 90% homology to a sequence selected from the group consisting 

35 of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41. 43. 45 ,47, 49. 51. 53. 55. 57, 59. 61. 63. 65. 67, 69, 71. 73. 75. 77. 79. 81. 
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5 83, 85, 87, 89, 91, 93. 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 
119, 121, 123, 125, 127, 129. 131, 133, 135. 137, 139, 141, 143, 145, 147, 149, 
151. 153, 155, 157, 159, 161. 163. 165, 167, 169, 171, 173, 175. 177, 179, 181, 
183. 185, 187, 189, 191, 193. 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
10 247, 249, 251, 253, 255, 257, 259, 261, 263. 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299. 

28. The isolated nucleic acid as claimed in claim 17, 18 or 19, wherein said sequence 
has at least about 95% homology to a sequence selected from the group consisting 

15 ofSEQIDNOS: 1,3,5,9, 11, 13, 15, 17, 19,21,23,25,27,29,31,33,35,37, 
39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 
83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 

20 183, 185, 187, 189, 191, 193, 195, 197, 199,201,203,205,207,209,211,213, 
215, 217, 219, 221, 223, 225, 227. 229, 231, 233, 235, 237, 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287. 289, 291. 293. 295, 297, 299. 

25 29. An isolated nucleic acid encoding a polypeptide selected from the group 
consisting of: 

polypeptides having an amino acid sequence selected from the group consisting 

of: 

SEQ ID NOS: 2, 4, 6, 10, 12. 14, 16. 18, 20, 22. 24, 26, 28, 30, 32, 34, 
30 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 

78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 
114, 116. 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202,204, 
35 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 

236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256. 258. 260, 262, 264, 
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5 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 

296, 298; 

variants having at least about S0% homology to at least one of SEQ ID 
NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34. 36, 38, 40, 42, 
44. 46. 48, 50, 52, 54, 56, 58, 60, 62, 64. 66, 68. 70, 72, 74, 76, 78, 80, 82, 84, 
10 86,88,90,92,94,96,98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118. 

120, 122, 124. 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152. 154. 156, 158, 160, 162, 164, 166. 168, 170. 172. 174, 176, 178, 
180, 182, 184, 186. 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 232, 234. 236. 238, 240, 
15 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 

272, 274, 276. 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, over a 
region of at least about 100 residues, as determined by analysis with a 
sequence comparison algorithm or by visual inspection; 

sequences complementaiy to SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 
20 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48. 50. 52, 54, 56, 58, 60, 62, 64. 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 
176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
25 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244. 
246. 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ED NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 
30 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58. 60, 62, 64, 66, 68, 70, 72, 

74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106. 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 
35 204, 206, 208. 210, 212, 214, 216, 218, 220, 222, 224, 226, 228. 230, 232, 

234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
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5 264, 266, 268, 270. 272. 274, 276. 278, 280, 282, 284. 286, 288, 290, 292, 

294, 296. 298 over a region of at least about 100 residues, as determined by 
analysis with a sequence comparison algorithm or by visual inspection; and 
polypeptides having at least 10 consecutive amino acids of a polypeptide having a 
sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 
10 18, 20, 22, 24, 26, 28, 30, 32, 34. 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 
64. 66, 68, 70, 72, 74, 76. 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98. 100. 102. 104, 
106, 108. 110. 112. 114, 116. 118, 120. 122, 124. 126. 128. 130, 132, 134, 136, 138, 
140, 142. 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166. 168, 170, 172, 
174, 176, 178. 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 
15 210, 212, 214. 216, 218, 220, 222. 224. 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. 

30. A purified polypeptide selected from the group consisting of: 
20 polypeptides having an amino acid sequence selected fcom the group consisting 
of: SEQ ID NOS: 2. 4, 6, 10, 12, 14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 64. 66, 68. 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90. 92, 94, 96. 98. 100. 102. 104, 106. 108. 1 10. 1 12, 1 14, 1 16, 1 18, 120, 122, 
124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
25 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 
192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
228, 230, 232, 234, 236, 238, 240. 242, 244. 246, 248, 250, 252, 254, 256, 258, 260, 
262, 264, 266, 268, 270, 272, 274, 276, 278. 280, 282. 284, 286, 288, 290, 292, 294, 
296,298; 

30 variants having at least about 50% homology to at least one of SEQ ID 

NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 

86, 88, 90. 92, 94, 96, 98. 100, 102, 104, 106, 108. 1 10, 1 12, 1 14, 1 16, 118, 
120, 122, 124, 126, 128, 130, 132, 134. 136, 138, 140, 142, 144, 146, 148, 
35 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172. 174, 176, 178, 

180. 182. 184. 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
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5 212, 214, 216, 218, 220, 222. 224, 226, 228, 230, 232, 234, 236, 238, 240, 

242, 244, 246, 248, 250, 252, 254. 256, 258, 260, 262, 264, 266. 268. 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. over a 
region of at least about 100 residues, as determined by analysis with a 
sequence comparison algorithm or by visual inspection; 
10 sequences complementary to SEQ ID NOS: 2. 4, 6, 10, 12, 14, 16, 18, 

20, 22. 24. 26. 28. 30. 32. 34. 36. 38. 40. 42, 44, 46, 48. 50, 52, 54, 56, 58, 60. 62, 64, 
66, 68, 70, 72. 74, 76. 78, 80, 82. 84. 86, 88, 90. 92, 94, 96, 98. 100, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126. 128. 130, 132, 134. 136. 138, 140. 
142. 144, 146, 148. 150, 152, 154, 156, 158, 160, 162, 164, 166. 168, HO, 172, 174, 
15 176, 178, 180. 182. 184, 186. 188, 190, 192, 194, 196. 198. 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224. 226. 228. 230, 232. 234. 236. 238. 240. 242, 244. 
246. 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280. 282, 284, 286, 288, 290, 292, 294. 296, 298; and 

sequences complementary to variants having at least about 50% 
20 homology to SEQ E) NOS: 2, 4, 6, 10, 12, 14, 16. 18, 20, 22. 24. 26, 28, 30, 

32, 34, 36, 38. 40. 42. 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82. 84. 86. 88. 90, 92. 94, 96. 98, 100, 102, 104, 106, 108, 1 10, 
112, 114, 116. 118, 120. 122, 124, 126. 128, 130, 132, 134, 136, 138, 140. 
142. 144. 146, 148. 150, 152, 154, 156, 158. 160, 162, 164. 166. 168. 170. 
25 172, 174, 176. 178, 180, 182, 184, 186. 188. 190, 192, 194, 196, 198, 202, 

204, 206, 208, 210, 212. 214. 216. 218, 220, 222, 224, 226, 228. 230, 232, 
234, 236. 238, 240, 242, 244. 246, 248, 250, 252. 254. 256, 258, 260, 262. 
264, 266, 268, 270, 272, 274. 276. 278. 280. 282, 284, 286, 288, 290. 292. 
294, 296, 298 over a region of at least about 100 residues, as determined by 
30 analysis with a sequence comparison algorithm or by visual inspection; and 

polypeptides having at least 10 consecutive amino acids of a polypeptide having a 
sequence selected from the group consisting of SEQ ID NOS: 2. 4. 6, 10, 12, 14, 16. 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 
64, 66, 68, 70, 72. 74, 76, 78, 80, 82, 84, 86, 88. 90, 92, 94, 96, 98. 100. 102. 104. 
35 106, 108, 1 10, 1 12, 1 14, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170. 172. 
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5 174. 176, 178. 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 
210. 212, 214. 216, 218. 220, 222, 224, 226, 228. 230, 232, 234, 236, 238, 240, 242, 
244. 246, 248, 250, 252. 254, 256, 258, 260. 262, 264, 266. 268. 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292. 294, 296, 298. 

10 31. The purified polypeptide as claimed in claim 30, wherein the amino acid sequence 
has at least about 50% homology to a sequence selected fiom the group consisting 
of SEQ ID NOS: 2. 4. 6. 10. 12. 14. 16. 18. 20. 22. 24. 26. 28, 30. 32. 34, 36, 38, 
40. 42, 44, 46. 48. 50, 52. 54. 56. 58, 60. 62. 64, 66, 68, 70. 72, 74, 76. 78, 80, 82, 
84, 86, 88, 90, 92. 94, 96. 98. 100. 102, 104, 106, 108. 1 10, 1 12. 1 14. 1 16. 118. 

15 120. 122. 124. 126, 128, 130, 132. 134, 136. 138. 140. 142. 144, 146, 148, 150, 
152. 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186. 188, 190, 192, 194. 196. 198. 202. 204. 206, 208. 210, 212, 214, 216, 
218. 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 
250. 252. 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 

20 282, 284, 286, 288, 290, 292, 294, 296, 298, over a region of at least about 200 
residues. 

32. The purified polypeptide as claimed in claim 30, wherein the amino acid sequence 

has at least about 50% homology to a sequence selected from the group consisting 
25 of SEQ ID NOS: 2. 4. 6. 10, 12, 14. 16, 18, 20, 22, 24. 26, 28. 30. 32, 34, 36, 38, 

40, 42, 44, 46, 48. 50. 52. 54, 56, 58, 60, 62, 64. 66. 68. 70, 72, 74. 76, 78, 80, 82. 

84. 86, 88, 90, 92, 94, 96. 98. 100, 102, 104, 106, 108, 1 10. 1 12. 1 14, 1 16, 1 18. 

120. 122, 124, 126, 128, 130, 132, 134, 136, 138. 140, 142. 144. 146. 148. 150. 

152. 154. 156. 158. 160. 162. 164. 166. 168. 170, 172, 174, 176, 178, 180, 182, 
30 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 214, 216. 

218. 220. 222. 224, 226, 228. 230, 232. 234, 236, 238. 240. 242. 244. 246. 248. 

250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 

282, 284, 286, 288, 290, 292, 294. 296, 298, over die entire sequence. 

35 33. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein in the amino 
acid sequence has at least about 55% homology to a sequence selected fiom the 
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5 group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 

30, 32, 34, 36, 38, 40, 42, 44, 46, 48. 50. 52, 54, 56. 58, 60, 62, 64, 66. 68, 70. 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92. 94, 96. 98, 100, 102, 104, 106, 108, 1 10, 
112. 114, 116. 118, 120. 122. 124, 126. 128. 130, 132. 134, 136. 138. 140. 142, 
144. 146, 148, 150, 152, 154. 156, 158, 160, 162, 164, 166, 168. 170. 172. 174. 
10 176. 178, 180, 182. 184, 186, 188, 190, 192. 194. 196, 198, 202, 204, 206, 208. 
210. 212. 214. 216. 218, 220, 222, 224, 226, 228, 230, 232, 234. 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254. 256. 258. 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284. 286, 288, 290, 292, 294, 296, 298. 

15 34. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 60% homology to a sequence selected from the group 
consisting of SEQ ID NOS: 2, 4. 6, 10, 12, 14, 16, 18, 20. 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 48, 50, 52. 54. 56. 58, 60, 62. 64. 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94. 96, 98. 100, 102, 104, 106, 108, 1 10, 1 12, 1 14. 

20 116. 1 18. 120. 122, 124, 126, 128, 130, 132. 134. 136, 138, 140, 142, 144. 146. 
148. 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170. 172, 174, 176, 178, 
180, 182, 184, 186, 188. 190, 192, 194, 196, 198, 202, 204, 206, 208,210, 212, 
214, 216. 218. 220. 222, 224, 226. 228, 230. 232, 234, 236, 238, 240. 242. 244, 
246. 248. 250, 252. 254. 256. 258, 260, 262, 264. 266. 268. 270, 272, 274. 276, 

25 278. 280. 282. 284. 286. 288, 290, 292, 294. 296. 298. 

35. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 65% homology to a sequence selected fiom the group 
consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16. 18. 20, 22, 24, 26, 28, 30, 32, 

30 34, 36, 38, 40. 42, 44, 46, 48, 50, 52, 54, 56, 58, 60. 62. 64. 66. 68. 70. 72, 74, 76, 
78, 80. 82. 84, 86, 88, 90, 92, 94, 96, 98, 100. 102, 104, 106. 108, 1 10, 1 12, 1 14, 
116, 118. 120, 122, 124, 126, 128, 130, 132, 134. 136. 138, 140, 142, 144, 146, 
148. 150, 152. 154. 156. 158. 160. 162, 164. 166. 168. 170. 172, 174, 176. 178. 
180. 182, 184, 186. 188. 190, 192, 194. 196, 198, 202, 204, 206, 208. 210, 212, 

35 214, 216, 218. 220. 222. 224, 226. 228, 230. 232, 234, 236, 238, 240, 242, 244, 
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5 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276. 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. 



36. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 70% homology to a sequence selected from the group 

10 consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34. 36, 38, 40. 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66. 68. 70, 72. 74. 76. 
78, 80, 82, 84. 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106. 108, 1 10. 1 12, 1 14, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136. 138, 140, 142, 144. 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170, 172, 174, 176, 178, 

15 180, 182, 184, 186, 188, 190. 192, 194. 196, 198, 202. 204. 206, 208. 210, 212, 
214, 216, 218, 220, 222. 224. 226. 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252. 254. 256, 258, 260. 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292. 294. 296. 298. 



20 37. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 75% homology to a sequence selected firom the group 
consisting of SEQ ID NOS: 2. 4. 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38. 40, 42, 44, 46, 48, 50, 52, 54. 56. 58. 60. 62. 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94. 96. 98, 100, 102, 104, 106, 108, 110, 112. 114. 

25 116. 1 18, 120, 122, 124, 126, 128. 130, 132. 134. 136, 138, 140, 142, 144. 146, 
148, 150, 152. 154. 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 
180. 182. 184, 186, 188, 190. 192. 194, 196. 198, 202. 204, 206, 208. 210, 212, 
214, 216, 218, 220. 222, 224. 226. 228, 230, 232, 234. 236. 238, 240. 242, 244, 
246. 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274. 276, 

30 278, 280, 282. 284. 286, 288, 290, 292, 294. 296, 298. 



38. The purified polypq>tide as claimed in claim 30, 31 or 32. wherein the amino acid 
sequence has at least about 80% homology to a sequence selected fiom the group 
consisting of SEQ ID NOS: 2. 4, 6. 10, 12, 14, 16. 18, 20, 22, 24, 26, 28, 30. 32, 
35 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56. 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80. 82, 84, 86. 88, 90, 92. 94, 96, 98, 100, 102. 104.*106, 108. 1 10, 112. 114, 
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5 116. 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 

148. 150, 152, 154, 156, 158, 160, 162. 164, 166, 168, 170. 172. 174. 176, 178, 
. 180, 182, 184, 186. 188. 190. 192, 194, 196. 198, 202, 204, 206, 208, 210, 212, 
214, 216, 218. 220, 222, 224. 226, 228. 230, 232, 234, 236. 238, 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258. 260. 262. 264. 266. 268, 270, 272, 274, 276, 
to 278,280,282.284,286,288,290.292,294,296,298. 

39. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 85% homology to a sequence selected fix)m the group 
consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 

15 34, 36, 38, 40, 42, 44. 46, 48, 50, 52, 54. 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80. 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142. 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 
180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 

20 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. 

40. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
25 sequence has at least about 90% homology to a sequence selected fivm the group 

consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 
116, 118, 120, 122, 124, 126, 128, 130. 132, 134, 136, 138, 140, 142, 144, 146, 
30 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 
180, 182, 184, 186, 188, 190. 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 
214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254. 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. 

35 
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5 41. The purified polypeptide as claimed in claim 30, 3 1 or 32, wherein the amino acid 
sequence has at least about 95% homology to a sequence selected fh>m the group 
consisting of SEQ ID NOS: 2. 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46. 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 

10 1 16, 1 18, 120, 122, 124, 126. 128. 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, ISO, 152, 154, 156, 158. 160, 162. 164. 166. 168, 170, 172, 174, 176, 178, 
180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 
214, 216, 218, 220, 222, 224, 226. 228, 230. 232, 234. 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258, 260. 262, 264, 266, 268, 270. 272, 274, 276, 

15 278. 280, 282. 284, 286, 288, 290, 292. 294, 296, 298. 

42. A purified polypeptide as claimed in claim 30, having an amino acid sequence 
selected firom the group consisting of SEQ ID NOS: 2, 4, 6, 10. 12, 14, 16, 18. 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 

20 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120. 122, 124, 126, 128, 130. 132. 134. 136, 
138. 140, 142, 144. 146, 148. 150. 152. 154, 156, 158, 160, 162, 164, 166. 168, 
170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202, 
204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226. 228, 230, 232. 234, 

25 236. 238, 240, 242. 244, 246, 248. 250, 252. 254. 256, 258, 260. 262, 264, 266, 
268, 270, 272, 274, 276, 278. 280. 282, 284, 286. 288, 290. 292, 294, 296, 298; 
and sequences having at least about 50% homology to at least one of SEQ ID 
NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 30, 32, 34, 36, 38, 40. 42, 44. 
46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 

30 90, 92, 94, 96, 98, 100, 102, 104, 106, 108. 1 10. 1 12, 1 14, 1 16. 1 18, 120, 122, 
124, 126. 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182. 184. 186, 
188, 190, 192. 194. 196, 198, 202, 204. 206. 208, 210, 212. 214, 216. 218. 220. 
222, 224, 226. 228. 230, 232, 234, 236, 238, 240, 242, 244. 246. 248, 250, 252, 

35 254, 256, 258, 260. 262. 264. 266, 268. 270. 272. 274, 276, 278, 280, 282. 284, 
286, 288. 290. 292, 294, 296, 298, over the entire sequence. 
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43. A purified antibody that specifically binds to a polypeptide selected from the 
group consisting of: 

polypeptides comprising an amino acid sequence selected from the group 
consisting of: 

10 SEQ ID NOS: 2, 4, 6, 10, 12. 14, 16, 18, 20, 22, 24, 26. 28, 30, 32, 34, 

36. 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144. 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170, 172, 

15 174. 176. 178. 180. 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 

206, 208. 210, 212, 214, 216, 218, 220, 222, 224. 226, 228, 230, 232, 234, 
236, 238. 240. 242. 244. 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 
266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 
296,298; 

20 variants having at least about 50% homology to at least one of SEQ ID 

NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30. 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 64. 66, 68, 70, 72. 74. 76, 78, 80, 82, 84, 
86, 88, 90. 92,94, 96, 98, 100, 102, 104, 106, 108. 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
25 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 

180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
212, 214, 216, 218. 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258. 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, over a 
30 region of at least about 100 residues, as determined by analysis with a 

sequence comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28. 30, 32, 34. 36. 38. 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78. 80. 82. 84. 86, 88. 90. 92. 94. 96. 98, 100, 102, 104, 106. 
35 108. 110, 112, 114, 116, 118, 120, 122, 124, 126. 128, 130, 132, 134. 136, 138, 140, 
142, 144, 146, 148, 150, 152. 154. 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 
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5 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210. 
212, 214, 216, 218, 220, 222, 224, 226, 228. 230, 232, 234, 236, 238, 240, 242, 244. 
246, 248, 250, 252. 254, 256, 258, 260, 262, 264, 266, 268, 270, 272. 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

sequences complementary to variants having at least about 50% 
10 homology to SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 

32, 34, 36, 38, 40. 42. 44. 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94. 96, 98, 100, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124. 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 162. 164. 166, 168, 170, 
15 172, 174, 176, 178, 180, 182. 184, 186, 188. 190. 192, 194. 196, 198. 202, 

204, 206, 208, 210, 212, 214, 216, 218. 220. 222. 224. 226. 228, 230, 232, 
234, 236, 238. 240, 242, 244. 246, 248, 250. 252, 254, 256, 258, 260. 262. 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 
294, 296, 298 over a region of at least about 100 residues, as determined by 
20 analysis with a sequence comparison algorithm or by visual inspection; and 

polypeptides having at least 10 consecutive amino acids of a polypeptide having a 
sequence selected firom the group consisting of SEQ ID NOS: 2, 4, 6, 10. 12, 14, 16, 
18, 20. 22, 24. 26. 28, 30, 32, 34. 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 
64, 66. 68. 70, 72, 74, 76, 78, 80, 82. 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
25 106, 108, 110. 112, 114, 116, 118, 120, 122. 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146. 148, 150, 152. 154, 156. 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178. 180. 182. 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 
210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248. 250. 252, 254, 256, 258, 260. 262. 264. 266, 268, 270, 272, 274, 276, 
30 278, 280, 282, 284. 286, 288, 290, 292, 294, 296, 298. 

44. A purified antibody as claimed in claim 43, that specifically binds to a polypeptide 
having at least 10 consecutive amino acids of a polypeptide selected from the 
group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20. 22. 24. 26. 28, 
35 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56. 58, 60, 62, 64, 66, 68, 70, 72, 
74. 76, 78, 80, 82, 84. 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 
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5 112, 114, 116, 118, 120, 122, 124, 126, 128, 130. 132. 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166. 168. 170, 172, 174, 
176, 178. 180. 182, 184, 186, 188. 190, 192, 194. 196, 198, 202, 204, 206, 208, 
210, 212. 214. 216. 218. 220, 222, 224, 226, 228. 230. 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252. 254, 256. 258, 260. 262, 264, 266, 268, 270, 272. 
10 274,276,278,280,282.284,286,288.290,292.294,296.298. 

45. The antibody of claim 43, wherein the antibodies are polyclonal. 

46. The antibody of claim 43, wherein the antibodies are monoclonal. 

15 

47. A method of producing a polypeptide selected from the group consisting of: 
polypeptides having an amino acid sequence selected from the group consisting 

of: SEQ K) NOS: 2, 4, 6, 10, 12, 14, 16, 1 8, 20, 22, 24. 26, 28, 30, 32, 34, 36. 38, 

40, 42, 44, 46. 48, 50, 52, 54. 56, 58, 60, 62, 64, 66, 68, 70. 72. 74, 76, 78, 80, 82. 84, 
20 86, 88, 90, 92, 94, 96, 98. 100, 102, 104, 106, 108, 1 10, 1 12, 1 14. 1 16, 1 18, 120, 122, 

124, 126, 128, 130, 132, 134, 136. 138, 140, 142, 144, 146, 148, 150, 152. 154. 156. 

158, 160, 162, 164, 166. 168, 170. 172. 174, 176, 178, 180. 182, 184, 186, 188. 190. 

192. 194. 196, 198, 202, 204. 206, 208, 210, 212. 214. 216, 218. 220, 222, 224, 226. 

228, 230. 232, 234, 236, 238, 240, 242, 244, 246. 248, 250. 252, 254, 256, 258. 260, 
25 262, 264. 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 

296,298; 

variants having at least about 50% homology to at least one of SEQ ID 
NOS: 2, 4, 6, 10. 12. 14, 16, 18, 20. 22, 24, 26. 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 64, 66, 68, 70, 72. 74, 76, 78, 80, 82, 84, 

30 86, 88, 90, 92. 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 

120, 122, 124, 126, 128, 130, 132. 134, 136, 138, 140, 142, 144, 146, 148. 
150, 152, 154, 156. 158, 160, 162, 164, 166, 168, 170, 172. 174. 176, 178, 
180, 182, 184. 186. 188, 190, 192, 194, 196, 198. 202. 204. 206, 208. 210, 
212, 214, 216, 218, 220. 222, 224, 226, 228, 230, 232, 234. 236, 238, 240, 

35 242. 244. 246, 248, 250. 252. 254. 256. 258, 260. 262, 264. 266. 268. 270. 

272, 274, 276. 278. 280, 282. 284, 286, 288, 290, 292. 294. 296. 298, over a 
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S region of at least about 1 00 residues, as determined by analysis with a 

sequence comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36. 38, 40, 42, 44, 46. 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66; 68, 70, 72. 74, 76, 78, 80, 82. 84, 86. 88, 90, 92. 94. 96, 98, 100, 102, 104, 106. 
10 108. 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 128. 130. 132, 134, ,136, 138, 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 
176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202.204,206.208,210. 
212. 214. 216, 218. 220, 222, 224, 226. 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252. 254, 256. 258, 260, 262. 264. 266. 268, 270. 272, 274,276, 278, 
1 5 280. 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ID NOS: 2, 4. 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30. 
32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80. 82, 84. 86, 88, 90. 92, 94, 96, 98, 100. 102. 104. 106, 108, 1 10, 
20 112, 114. 116, 118, 120, 122, 124. 126, 128. 130. 132, 134, 136, 138, 140, 

142, 144, 146, 148, 150, 152. 154. 156. 158. 160. 162, 164. 166, 168, 170, 
172, 174, 176, 178, 180. 182, 184, 186, 188, 190. 192, 194, 196, 198, 202, 
204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 232, 
234, 236, 238, 240. 242, 244, 246, 248, 250, 252, 254, 256. 258, 260, 262, 
25 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292. 

294, 296, 298 over a region of at least about 100 residues, as determined by 
analysis with a sequence comparison algorithm or by visual inspection; and 
polypeptides having at least 10 consecutive amino acids of a polypq)tide having a 
sequence selected fiom the group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 
30 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92. 94. 96, 98. 100, 102. 104. 
106. 108, 110. 112, 114, 116, 118, 120, 122, 124, 126, 128. 130. 132, 134, 136. 138, 
140, 142. 144, 146, 148, 150, 152, 154, 156, 158. 160, 162. 164, 166, 168. 170, 172, 
174. 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202, 204,206,208, 
35 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
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S 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; comprising the steps of 
introducing a nucleic acid encoding the polypeptide into a host cell under conditions 
that allow expression of the polypeptide, and recovering the polypeptide. 



48. A method of producing a polypeptide comprising at least 10 amino acids of a 

10 ' sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38. 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76. 78, 80, 82, 84, 86, 88, 90, 92, 94. 96, 98, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 

15 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 
198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 
232, 234, 236, 238, 240, 242, 244, 246, 248, 250. 252. 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282. 284, 286, 288, 290, 292, 294. 
296. 298. comprising the steps of: introducing a nucleic acid encoding the 

20 polypeptide, operably linked to a promoter, into a host cell under conditions that 
allow expression of the polypeptide, and recovering the polypeptide. 

49. A method of generating a variant comprising: 

obtaining a nucleic acid comprising a polynucleotide selected from the group 

25 consisting of: 

SEQ E) NOS: 1, 3, 5, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57. 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 

30 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 

173, 175. 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201, 
203. 205. 207, 209. 211, 213. 215, 217. 219. 221. 223, 225, 227, 229, 231, 
233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 
263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287. 289, 291, 

35 293, 295, 297, 299; 
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S variants having at least about 50% homology to at least one of SEQ ID 

NOS: 1, 3. 5. 9, 1 1. 13, 15. 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37, 39, 41, 
43, 45 ,47, 49, 51, 53, 55, 57. 59, 61, 63. 65. 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87. 89, 91. 93. 95. 97. 99. 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 
119, 121, 123, 125, 127. 129, 131. 133, 135, 137, 139, 141, 143, 145, 147, 
10 149. 151, 153. 155. 157. 159. 161, 163, 165, 167, 169. 171. 173, 175, 177, 

179, 181, 183, 185, 187, 189, 191. 193. 195, 197, 199, 201, 203, 205, 207, 
209, 21 1, 213, 215, 217, 219, 221, 223. 225. 227. 229, 231. 233, 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261. 263, 265. 267, 
269, 271, 273, 275, 277. 279. 281. 283, 285. 287, 289, 291, 293, 295, 297, 299 
15 over a region of at least about 100 residues, as determined by analysis with a' 

sequence comparison algorithm or by visusd inspection; 

sequences complementary to SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 
19, 21, 23. 25. 27, 29, 31, 33, 35, 37. 39. 41, 43, 45 ,47, 49. 51, 53, 55, 57, 59, 61, 63. 
65. 67, 69, 71, 73, 75, 77, 79, 81. 83. 85, 87. 89, 91. 93, 95. 97. 99, 101. 103. 105. 
20 107, 109, 111, 113, 115, 117. 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157. 159. 161, 163, 165, 167, 169, 171, 173, 
175, 177, 179. 181. 183, 185, 187, 189, 191, 193, 195, 197, 199. 201,203,205,207, 
209, 21 1, 213. 215, 217, 219, 221, 223, 225. 227. 229, 231, 233, 235, 237, 239, 241, 
243, 245. 247, 249, 251, 253, 255, 257, 259, 261, 263. 265, 267, 269. 271, 273, 275, 
25 277, 279, 28 1, 283, 285, 287, 289, 291 , 293, 295. 297. 299; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ID NOS: 1.3.5.9. 11, 13, 15, 17. 19,21,23.25.27,29,31. 
33. 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63. 65, 67, 69. 71, 73, 
75. 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101. 103, 105, 107. 109, 111, 
30 113, 115, 117, 119. 121. 123. 125. 127, 129, 131, 133, 135, 137, 139, 141, 

143, 145, 147. 149, 151, 153, 155, 157. 159, 161, 163, 165. 167, 169, 171, 
173, 175. 177. 179. 181. 183, 185, 187. 189, 191, 193, 195, 197, 199,201. 
203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 
233, 235, 237, 239, 241, 243, 245. 247, 249, 251, 253. 255, 257, 259, 261, 
35 263. 265, 267. 269. 271. 273. 275. 277. 279. 281, 283, 285, 287, 289, 291, 
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5 293, 295, 297, 299 over a region of at least about 100 residues, as determined 

by analysis with a sequence comparison algorithm or by visual inspection; and 

fragments comprising at least 30 consecutive nucleotides any of the 
foregoing sequences; and 

modifying one or more nucleotides in said polynucleotide to another 
10 nucleotide, deleting one or more nucleotides in said polynucleotide, or adding one 
or more nucleotides to said polynucleotide. 

50. The method of claim 49, wherein the modifications are introduced by a method 
selected from the group consisting of: error-prone PCR, shuffling, 
IS oligonucleotide-directed mutagenesis, assembly PCR, sexual PCR mutagenesis, in 
vivo mutagenesis, cassette mutagenesis, recursive ensemble mutagenesis, 
exponential ensemble mutagenesis, site-specific mutagenesis, gene reassembly, 
gene site saturated mutagenesis and any combination of these methods. 

20 51. The method of claim 50, wherein the modifications are introduced by error-prone 
PCR. 

52. The method of claim 50, wherein the modifications are introduced by shuffling. 

25 53. The method of claim 50, wherein the modifications are introduced by 
oligonucleotide-directed mutagenesis. 

54. The method of claim 50, wherein the modifications are introduced by assembly 
PCR. 

30 

55. The method of claim 50, wherein the modifications are introduced by sexual PCR 
mutagenesis. 

56. The method of claim 50, wherein the modifications are introduced by in vivo 
35 mutagenesis. 
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S S7. The method of claim SO, wherein the modifications are introduced by cassette 
mutagenesis. 

58. The method of claim SO, wherein the modifications are- introduced by recursive 
ensemble mutagenesis. 

10 

59. The method of claim SO, wherein the modifications are introduced by exponential 
ensemble mutagenesis. 

60. The method of claim SO, wherein the modifications are introduced by site-specific 
IS mutagenesis. 

61. The method of claim SO, wherein the modifications are introduced by gene 
reassembly. 

20 62. The method of claim SO, wherein the modifications are introduced by gene site 
saturated mutagenesis. 

63. A computer readable medium having stored thereon a sequence selected from the 
group consisting of: 

25 nucleic acid sequences of SEQ ID NOS: 1, 3, S, 9, 11, 13, IS, 17, 19, 

21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121. 123, 125, 127. 129, 131, 
133, 135, 137, 139, 141, 143, 145, 147, 149, 151. 153, 155, 157. 159. 161, 

30 163, 165. 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 

193, 195, 197, 199, 201, 203, 205. 207, 209, 211. 213, 215, 217. 219, 221. 
223, 225, 227, 229, 231, 233, 235, 237, 239, 241. 243, 245, 247, 249, 251, 
253, 255, 257. 259, 261, 263, 265. 267, 269, 271. 273. 275. 277, 279, 281, 
283. 285, 287, 289, 291, 293, 295, 297, 299; 

35 variants of a nucleic acid sequence having at least about 50% 

homology to at least one of SEQ ID NOS: 1,3,5,9, 11, 13, 15, 17, 19,21,23, 
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5 25, 27, 29, 31, 33. 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 

67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113. 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141. 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 
167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 
10 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 

227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255. 
257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 29 1, 293, 295, 297, 299, over a region of at least about 100 residues, 
as determined by analysis with a sequence comparison algorithm or by visual 
15 inspection; 

nucleic acid sequences complementary to SEQ ID NOS: 1, 3, 5, 9, 1 1, 
13, 15, 17, 19, 21. 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 
59, 61. 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125. 127, 129, 131, 133, 135, 
20 137, 139, 141, 143, 145, 147, 149, 151. 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195. 197. 199. 201. 203. 
205. 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233. 235, 237, 
239, 241, 243, 245, 247, 249. 251. 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299; 
25 nucleic acid sequences complementary to variants of nucleic acid 

sequences having at least about 50% homology to SEQ ID NOS: 1, 3, 5, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 
55, 57, 59, 61, 63, 65, 67. 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
30 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 

159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 
189, 191. 193. 195, 197, 199,201,203,205,207.209.211.213,215,217, 
219, 221, 223, 225, 227, 229, 231, 233. 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253. 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
35 279, 28 1 , 283, 285, 287, 289, 29 1 , 293, 295, 297, 299over a region of at least 
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5 about 1 00 residues, as determined by analysis with a sequence comparison 

algorithm or by visual inspection; 

polypeptide sequences SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72. 74, 76. 78, 80, 82, 84, 86. 88, 90, 92, 94, 96, 98, 100, 102. 104, 
10 106. 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134. 

136, 138, 140. 142, 144, 146, 148. 150, 152. 154, 156, 158, 160, 162. 164. 
166. 168, 170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218. 220, 222. 224, 226. 
228, 230, 232. 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
15 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278. 280, 282, 284, 286, 

288, 290, 292, 294, 296, 298; 

variants of polypeptide sequences having at least about 50% homology 
to at least one of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64. 66. 68, 70, 
20 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94. 96. 98. 100, 102, 104, 106, 108, 

110, 112, 114, 116, 118, 120, 122, 124, 126. 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186. 188, 190, 192, 194, 196, 198, 
202, 204, 206, 208, 210, 212, 214, 216, 218, 220. 222. 224. 226, 228. 230, 
25 232, 234, 236, 238, 240, 242, 244, 246, 248, 250. 252, 254, 256, 258, 260, 

262, 264, 266, 268, 270, 272, 274, 276, 278, 280. 282, 284, 286, 288, 290, 
292, 294, 296, 298, over a region of at least about 100 residues, as determined 
by analysis with a sequence comparison algorithm or by visual inspection; 

polypeptide sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 
30 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 
35 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
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5 242, 244, 246. 248, 250, 252, 254, 256, 258. 260. 262. 264, 266. 268, 270, 272, 274, 
276, 278, 280. 282, 284. 286, 288, 290. 292, 294, 296. 298; and 

polypeptide sequences complementary to variants of polypeptide 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6, 10, 
12, 14, 16, 18, 20, 22. 24, 26, 28, 30, 32. 34. 36. 38, 40, 42, 44. 46. 48. 50. 52, 

10 54. 56, 58. 60, 62. 64, 66. 68, 70. 72. 74. 76. 78. 80, 82, 84, 86, 88, 90. 92. 94. 

96.98, 100, 102, 104, 106, 108, 110, 112. 114, 116, 118. 120. 122, 124, 126, 
128, 130. 132. 134, 136. 138, 140. 142, 144, 146, 148, 150. 152. 154, 156, . 
158, 160, 162. 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192. 194, 196. 198. 202. 204. 206, 208. 210, 212. 214, 216, 218. 

15 220, 222, 224, 226, 228, 230, 232, 234. 236, 238. 240, 242. 244, 246, 248, 

250, 252, 254, 256, 258, 260, 262, 264, 266, 268. 270. 272. 274, 276, 278. 
280, 282, 284, 286. 288, 290. 292, 294, 296. 298 over a region of at least 
about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

20 

64. A computer system comprising a processor and a data storage device wherein said 
data storage device has stored thereon a sequence selected from the group 
consisting of: 

nucleic acid sequences of SEQ ID NOS: 1, 3, 5. 9, 1 1. 13. 15. 17, 19, 
25 21, 23. 25, 27, 29, 31, 33, 35. 37, 39, 41, 43, 45 .47. 49, 51. 53. 55, 57, 59, 61. 

63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91. 93, 95. 97. 99. 101, 
103, 105, 107, 109, 111. 113. 115. 117, 119, 121, 123, 125, 127, 129, 131, 133, 
135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157. 159. 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189. 191, 193, 
30 195, 197, 199,201,203,205,207,209.211.213,215.217,219,221,223, 

225, 227, 229. 231, 233. 235. 237. 239, 241. 243, 245. 247, 249, 251, 253, 
255, 257. 259. 261. 263, 265, 267. 269. 271, 273. 275, 277, 279, 281, 283, 
285,287,289,291,293.295.297,299; 

variants of a nucleic acid sequence having at least about 50% 
35 homology to at least one of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15. 17, 19, 21. 23, 

25. 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53. 55. 57. 59. 61. 63. 65. 
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5 67, 69. 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105. 

107. 109. 111. 113, 115. 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137. 139. 141, 143. 145, 147, 149, 151, 153, 155, 157, 159, 161, 163. 165, 
167, 169. 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 
197, 199, 201, 203. 205. 207, 209. 211, 213. 215, 217, 219, 221, 223, 225, 
10 227. 229. 231. 233. 235, 237. 239, 241, 243, 245, 247, 249, 251, 253, 255, 

257, 259. 261, 263, 265, 267, 269, 271, 273, 275. 277, 279. 281. 283. 285, 
287, 289, 291, 293, 295, 297, 299, over a region of at least about 100 residues, 
as determined by analysis with a sequence comparison algorithm or by visual 
inspection; 

15 nucleic acid sequences complementary to SEQ ID NOS; 1, 3, 5, 9, 1 1, 

13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37. 39, 41, 43, 45 .47, 49, 51, 53, 55, 57, 
59, 61. 63, 65. 67. 69, 71, 73, 75. 77, 79, 81. 83, 85. 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 
20 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201,203, 
205, 207, 209, 21 1, 213, 215. 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277. 279. 281, 283, 285, 287, 289, 291, 293, 295, 297, 299; 

nucleic acid sequences complementary to variants of nucleic acid 
25 sequences having at least about 50% homology to SEQ ID NOS: 1 , 3, 5, 9, 1 1 . 

13. 15. 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53. 
55, 57, 59. 61, 63, 65, 67, 69, 71, 73, 75, 77, 79. 81, 83. 85, 87, 89, 91, 93, 95, 
97,99, 101, 103. 105, 107, 109, 111, 113. 115, 117, 119, 121. 123. 125. 127, 
129, 131, 133, 135, 137, 139. 141, 143, 145, 147. 149, 151, 153, 155, 157, 
30 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179. 181, 183, 185. 187, 

189, 191, 193. 195, 197, 199. 201, 203, 205, 207, 209, 211. 213, 215, 217, 
219, 221, 223, 225, 227, 229, 231, 233. 235. 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269. 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299 over a region of at least 
35 about 100 residues, as determined by analysis with a sequence comparison 

algorithm or by visual inspection; 
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5 polypeptide sequences SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 

24, 26. 28, 30. 32. 34. 36. 38. 40, 42. 44, 46. 48, 50. 52, 54, 56, 58, 60, 62, 64, 
66, 68. 70. 72. 74. 76. 78. 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106. 108, 110, 112, 114. 116. 118. 120, 122, 124. 126, 128, 130, 132, 134, 
136, 138. 140. 142. 144. 146. 148. 150. 152, 154. 156. 158. 160. 162, 164, 
10 166, 168. 170. 172, 174. 176. 178, 180, 182, 184, 186, 188, 190, 192, 194, 

196, 198, 202, 204, 206, 208. 210, 212, 214. 216, 218, 220, 222, 224, 226. 
228, 230. 232, 234, 236, 238. 240. 242, 244. 246. 248. 250. 252. 254. 256, 
258, 260. 262. 264, 266, 268, 270. 272, 274, 276. 278. 280. 282, 284, 286. 
288.290.292.294.296,298: 
1 5 variants of polypeptide sequences having at least about 50% homology 

to at least one of SEQ ID NOS: 2, 4, 6, 10, 12, 14. 16. 18. 20, 22. 24, 26. 28. 
30. 32, 34, 36, 38, 40, 42, 44. 46, 48, 50, 52, 54, 56, 58, 60, 62, 64. 66. 68. 70. 
72. 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112. 1 14. 116. 118, 120, 122. 124, 126, 128, 130, 132, 134, 136, 138, 
20 140. 142, 144. 146. 148. 150. 152, 154, 156, 158, 160, 162, 164, 166, 168, 

170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 
202, 204, 206, 208. 210, 212, 214, 216, 218, 220, 222, 224. 226. 228, 230, 
232. 234, 236, 238, 240, 242. 244, 246, 248, 250, 252. 254, 256, 258, 260, 
262, 264. 266. 268. 270, 272, 274. 276, 278, 280, 282, 284, 286. 288. 290, 
25 292. 294, 296. 298. over a region of at least about 100 residues, as detennined 

by analysis with a sequence comparison algorithm or by visual inspection; 

polypeptide sequences complementary to SEQ ID NOS: 2. 4. 6. 10, 12. 
14, 16, 18, 20. 22. 24, 26. 28, 30. 32. 34, 36. 38. 40, 42. 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74. 76. 78. 80. 82. 84. 86. 88. 90. 92. 94. 96. 98. 100. 102, 
30 104, 106. 108, 110, 1 12. 1 14. 1 16, 118, 120, 122, 124, 126, 128. 130. 132. 134, 136, 
138, 140. 142, 144, 146, 148, 150, 152, 154, 156. 158. 160. 162, 164, 166. 168. 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188. 190, 192, 194, 196, 198, 202, 204, 206. 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262. 264, 266, 268, 270, 272. 274. 
35 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 
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S polypeptide sequences complementary to variants of polypeptide 

sequences having at least about 50% homology to SEQ K) NOS: 2, 4, 6, 10, 
12, 14, 16, 18, 20. 22, 24, 26, 28, 30, 32, 34, 36, 38, 40. 42, 44, 46, 48, 50, 52, 
54, 56, 58, 60, 62. 64, 66, 68, 70, 72, 74, 76, 78, 80. 82, 84, 86, 88, 90, 92. 94. 
96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 

10 128, 130, 132, 134. 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156. 

158, 160. 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 202, 204. 206, 208, 210, 212, 214, 216, 218, 
220. 222, 224, 226, 228, 230, 232, 234, 236. 238, 240, 242, 244, 246, 248, 
250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 

15 280, 282, 284, 286, 288, 290, 292, 294, 296, 298 over a region of at least 

about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

65. The computer system of claim 64, further comprising a sequence comparison 
20 algorithm and a data storage device having at least one reference sequence stored 
thereon. 



66. The computer system of claim 65, wherein the sequence comparison algorithm 
comprises a computer program which indicates polymorphisms. 

25 

67. The computer system of claim 64, further comprising an identifier which identifies 
one or more features in said sequence. 

68. A method for comparing a first sequence to a second sequence comprising the steps 
30 of: 

reading the first sequence and the second sequence through use of a computer 
program which compares sequences; and 

determining differences between the first sequence and the second sequence with 
the computer program, wherein said first sequence is a sequence selected from the 
35 group consisting of: 
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5 nucleic acid sequences of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15. 17, 19, 

21, 23, 25, 27, 29, 31, 33, 35, 37. 39, 41, 43, 45 .47, 49, 51, 53. 55. 57. 59. 61. 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145. 147, 149, 151, 153, 155, 157, 159, 161, 

10 163, 165, 167, 169, 171. 173. 175. 177. 179, 181. 183. 185, 187, 189, 191, 

193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 
223. 225. 227. 229. 231, 233, 235, 237. 239. 241, 243, 245, 247, 249, 251. 
253, 255. 257, 259. 261, 263. 265. 267. 269. 271. 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291. 293, 295, 297, 299; 

1 5 variants of a nucleic acid sequence having at least about 50% 

homology to at least one of SEQ ID NOS: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 
25, 27i 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65. 
67, 69, 71. 73, 75. 77. 79, 81, 83. 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 

20 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161. 163, 165, 

167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 
197, 199, 201, 203, 205, 207, 209, 211. 213. 215. 217. 219. 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239, 241, 243, 245. 247. 249, 251, 253, 255. 
257, 259, 261, 263, 265, 267. 269. 271, 273, 275, 277, 279. 281. 283, 285, 

25 287, 289, 291, 293, 295, 297, 299, over a region of at least about 100 residues, 

as determined by analysis with a sequoice comparison algorithm or by visual 
inspection; 

nucleic acid sequences complementary to SEQ ID NOS: 1, 3, 5. 9. 1 1, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41. 43, 45 ,47, 49, 51, 53. 55, 57, 

30 59, 61. 63, 65. 67. 69. 71. 73. 75. 77. 79, 81. 83. 85. 87. 89, 91, 93. 95, 97, 99. 101. 
103, 105. 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137. 139, 141, 143, 145. 147. 149. 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171. 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199.201,203, 
205, 207. 209, 21 1, 213, 215, 217. 219, 221. 223, 225, 227, 229, 231, 233, 235, 237, 

35 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299; 
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5 nucleic acid sequences complementary to variants of nucleic acid 

sequences having at least about 50% homology to SEQ ID NOS: 1, 3, 5, 9, 1 1, 

13. 15, 17, 19, 21, 23, 25, 27, 29. 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 

55, 57, 59, 61, 63. 65, 67. 69, 71, 73. 75, 77. 79, 81, 83, 85. 87, 89. 91, 93. 95. 

97.99. 101, 103. 105, 107. 109, 111. 113. 115. 117. 119, 121, 123. 125, 127, 
10 129. 131. 133. 135. 137. 139. 141. 143. 145. 147, 149. 151. 153, 155, 157. 

159. 161. 163, 165, 167, 169, 171. 173. 175, 177. 179, 181, 183, 185, 187, 

189, 191, 193, 195, 197. 199, 201, 203, 205, 207, 209, 211, 213. 215. 217. 

219. 221, 223. 225, 227, 229, 231, 233, 235, 237, 239, 241. 243, 245. 247, 

249, 251, 253. 255, 257. 259. 261. 263. 265. 267. 269. 271. 273. 275, 277, 
15 279, 281, 283. 285. 287. 289, 291, 293, 295, 297, 299 over a region of at least 

about 100 residues, as determined by analysis with a sequence comparison 

algorithm or by visual inspection; 

polypeptide sequences SEQ ID NOS: 2, 4, 6, 10, 12. 14. 16, 18, 20, 22, 

24, 26, 28, 30, 32, 34, 36, 38. 40. 42. 44, 46, 48. 50. 52. 54, 56, 58, 60, 62, 64, 
20 66, 68. 70. 72. 74, 76, 78, 80. 82, 84, 86, 88, 90, 92, 94, 96, 98, 100. 102. 104. 

106. 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132. 134. 

136. 138. 140. 142, 144, 146, 148. 150, 152. 154, 156, 158, 160, 162, 164, 

166, 168, 170, 172. 174, 176, 178, 180, 182. 184. 186. 188, 190, 192, 194, 

196, 198, 202. 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
25 228, 230, 232, 234, 236, 238. 240. 242. 244. 246. 248, 250, 252, 254. 256, 

258, 260, 262, 264, 266, 268. 270. 272, 274, 276, 278, 280, 282, 284, 286, 

288,290,292,294,296,298; 

variants of polypeptide sequences having at least about 50% homology 

to at least one of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 
30 30, 32, 34, 36, 38, 40. 42. 44. 46. 48. 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 

72, 74. 76. 78. 80. 82, 84, 86. 88. 90. 92. 94. 96. 98. 100. 102. 104. 106. 108. 

110, 112, 114, 116, 118, 120, 122, 124. 126, 128, 130, 132, 134, 136, 138. 

140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166. 168. 

170. 172. 174, 176, 178, 180, 182, 184. 186, 188, 190, 192, 194. 196. 198. 
35 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 

232. 234, 236, 238, 240, 242, 244. 246. 248, 250, 252, 254, 256, 258, 260, 
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5 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 

292, 294, 296, 298, over a region of at least about 100 residues, as determined 
by analysis with a sequence comparison algorithm or by visual inspection; 

polypeptide sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 
14, 16, 18, 20; 22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
10 60, 62, 64, 66, 68, 70, 72. 74. 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102. 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130. 132, 134, 136. 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158. 160. 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
15 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

polypq)tide sequences complementary to variants of polypeptide 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6, 1 0, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 
20 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90. 92, 94, 

96,98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122. 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 
25 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 

250, 252, 254. 256, 258, 260. 262, 264, 266, 268, 270, 272. 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298 over a region of at least 
about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

30 

69. The mediod of claim 68, wherem tiie step of detennining differences between the 
first sequence and the second sequence fiirther comprises the stq> of identifying 
polymorphisms. 



35 



70. A method for identifying a feature in a sequence comprising the steps of: 
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5 reading the sequence using a computer program which identifies one or more 
features in a sequence; and 

identifying one or more features in the sequence with the computer program, 
wherein the sequence is selected from the group consisting of: 

nucleic acid sequences of SEQ ID NOS: 1, 3, 5, 9, 11, 13, 15, 17, 19, 

10 21. 23. 25, 27, 29, 31, 33, 35. 37, 39. 41, 43. 45 .47. 49. 51. 53. 55, 57, 59, 61. 

63, 65, 67, 69, 71, 73, 75, 77. 79, 81. 83. 85, 87, 89, 91, 93, 95, 97, 99. 101. 
103, 105. 107. 109. Ill, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131. 
133, 135, 137. 139. 141, 143, 145, 147, 149, 151, 153, 155. 157. 159. 161. 
163, 165. 167. 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189. 191, 

15 193, 195, 197. 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217. 219, 221, 

223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245. 247, 249. 251, 
253, 255, 257, 259, 261, 263, 265, 267, 269, 271. 273. 275. 277, 279, 281. 
283, 285, 287, 289, 291, 293, 295, 297, 299; 

variants of a nucleic acid sequence having at least about 50% 

20 homology to at least one ofSEQ ID NOS: 1,3,5,9, 11, 13,15, 17,19,21,23, 

25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 

25 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 

197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219. 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 
257, 259, 261, 263. 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, over a region of at least about 100 residues, 

30 as determined by analysis with a sequence comparison algorithm or by visual 

inspection; 

nucleic acid sequences complementary to SEQ ID NOS: 1, 3, 5, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33, 35. 37. 39. 41, 43, 45 ,47, 49, 51, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81. 83, 85, 87. 89. 91, 93, 95, 97, 99, 101, 
35 103, 105, 107, 109, 111, 113, 115, 117, 119. 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 
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5 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 
205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233. 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267. 269. 271, 
273, 275, 277, 279, 281, 283, 285, 287. 289, 291, 293. 295. 297, 299; 

nucleic acid sequences complementary to variants of nucleic acid 
1 0 sequences having at least about 50% homology to SEQ ID NOS: 1 , 3, 5, 9, 11 , 

13, 15, 17. 19, 21, 23, 25, 27. 29, 31. 33. 35. 37, 39, 41, 43. 45 ,47, 49, 51, 53, 
55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83. 85. 87. 89, 91. 93, 95, 
97,99, 101, 103, 105, 107. 109. 111. 113. 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 
15 159. 161, 163, 165, 167. 169. 171, 173. 175. 177. 179, 181, 183, 185, 187, 

189, 191, 193, 195, 197, 199, 201. 203. 205. 207. 209, 211. 213, 215, 217, 
219, 221, 223, 225, 227, 229, 23 1, 233. 235, 237, 239. 241. 243. 245. 247. 
249, 251, 253. 255. 257. 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289. 291. 293, 295. 297, 299 over a region of at least 
20 about 100 residues, as determined by analysis with a sequence comparison 

algorithm or by visual inspection; 

polypeptide sequences SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20. 22, 
24. 26, 28. 30, 32, 34, 36. 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92. 94. 96. 98, 100, 102, 104, 
25 106, 108, 1 10, 1 12, 1 14. 1 16. 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 

136, 138, 140, 142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170, 172, 174, 176, 178, 180. 182. 184. 186. 188. 190, 192, 194, 
196, 198. 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
30 258, 260, 262, 264, 266, 268, 270, 272, 274. 276, 278, 280, 282, 284, 286, 

288,290,292,294.296,298; 

variants of polypeptide sequences having at least about 50% homology 
to at least one of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52. 54, 56, 58, 60, 62, 64, 66, 68, 70, 
35 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108. 

110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138. 
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5 140, 142, 144, 146, 148, 150, 152, 154, 156. 158, 160, 162, 164, 166. 168, 

170, 172. 174, 176, 178, 180, 182, 184, 186. 188. 190. 192, 194, 196, 198. 
202. 204, 206. 208. 210, 212, 214. 216, 218, 220, 222, 224, 226. 228. 230. 
232, 234. 236, 238, 240. 242. 244, 246. 248. 250. 252. 254, 256, 258, 260, 
262, 264, 266, 268, 270, 272, 274, 276, 278, 280. 282. 284. 286, 288. 290, 
10 292, 294. 296. 298. over a region of at least about 100 residues, as determined 

by analysis with a sequence comparison algorithm or by visual inspection; 

polypeptide sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58. 
60, 62, 64, 66, 68, 70, 72, 74. 76. 78, 80, 82, 84, 86, 88, 90, 92, 94. 96, 98, 100, 102. 
15 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128. 130, 132, 134. 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170. 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202,204,206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238. 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
20 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

polypeptide sequences complementary to variants of polypeptide 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 
54, 56, 58, 60, 62, 64. 66. 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
25 96,98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 

128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 
220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 
30 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276. 278, 

280, 282, 284, 286, 288, 290, 292, 294, 296. 298 over a region of at least 
about 100 residues, as determined by analysis with a sequence comparison 
algorithm or by visual inspection. 

35 7 1 . A method of hydrolyzing a starch linkage comprising contacting a substance 
containing the starch with a polypeptide selected from the group consisting of 
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5 SEQ ID NOS: 2, 4, 6, 10, 1 2, 14, 1 6, 1 8, 20, 22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88. 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138. 140, 142. 144, 146, 148,. 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 

10 186. 188, 190, 192, 194, 196. 198, 202, 204, 206, 208, 210, 212. 214. 216. 218. 
220. 222. 224. 226. 228, 230, 232, 234, 236. 238, 240. 242, 244, 246. 248. 250, 
252, 254, 256, 258, 260, 262, 264. 266, 268, 270, 272, 274, 276, 278, 280, 282, 
284, 286, 288, 290, 292, 294, 296, 298, and sequences substantially identical 
thereto, under conditions which fecilitate the hydrolysis of the carbon-halogen 

15 linkage. 



20 

72. A mediod of catalyzing the breakdown of a starch, comprising the stq> of 
contacting a sample containing starch with a polypeptide having a sequence 
selected from the group consisting of: 

polypeptide sequences SEQ ID NOS: 2, 4, 6, 10, 12. 14, 16, 18, 20. 22, 
25 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46. 48, 50, 52, 54, 56, 58. 60, 62, 64, 

66. 68, 70. 72. 74. 76, 78. 80, 82, 84, 86, 88, 90, 92, 94. 96. 98. 100, 102, 104. 
106. 108, 110, 112. 114, 116, 118, 120, 122, 124. 126. 128. 130. 132, 134, 
136, 138, 140, 142, 144. 146. 148. 150, 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170. 172, 174. 176, 178. 180, 182. 184, 186, 188, 190, 192, 194, 
30 196, 198, 202. 204. 206, 208, 210, 212, 214, 216, 218. 220, 222, 224, 226, 

228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254. 256, 
258, 260, 262. 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 
288,290,292,294.296,298; 

variants of polypeptide sequences having at least about 50% homology 
35 to at least one of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 

30. 32. 34. 36. 38. 40. 42. 44. 46. 48. 50, 52, 54, 56, 58, 60, 62, 64. 66. 68, 70, 
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5 72, 74, 76, 78, 80, 82, 84. 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 

110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138. 
140, 142, 144, 146, 148, 150. 152. 154, 156, 158. 160, 162, 164, 166, 168, 
170, 172. 174. 176, 178, 180, 182. 184. 186. 188. 190. 192. 194, 196, 198, 
202, 204, 206. 208. 210. 212. 214. 216. 218, 220, 222, 224, 226, 228. 230, 
10 232, 234, 236, 238. 240. 242. 244. 246. 248, 250, 252, 254, 256, 258, 260, 

262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284. 286. 288. 290. 
292, 294, 296, 298, over a region of at least about 100 residues, as deteimined 
by analysis with a sequence comparison algorithm or by visual inspection; 

polypeptide sequences complementary to SEQ ID NOS: 2, 4, 6. 10, 12, 
15 14, 16, 18. 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42. 44. 46, 48, 50, 52, 54, 56, 58, 
60, 62. 64. 66, 68, 70. 72, 74. 76. 78, 80. 82, 84, 86, 88, 90, 92, 94. 96, 98. 100, 102, 
104. 106. 108, 110, 112, 114, 116, 118, 120, 122, 124, 126. 128. 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152. 154, 156, 158, 160, 162, 164, 166, 168, 170. 
172, 174. 176, 178. 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204. 206, 
20 208, 210, 212, 214, 216, 218. 220. 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246. 248. 250. 252. 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

polypeptide sequences complementary to variants of polypeptide 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6, 10, 
25 12, 14, 16, 18, 20, 22, 24, 26. 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 

54, 56. 58. 60. 62. 64. 66, 68, 70, 72, 74, 76, 78, 80, 82. 84. 86. 88, 90, 92, 94, 
96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128. 130, 132. 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
30 188, 190. 192, 194, 196, 198, 202, 204, 206. 208, 210, 212, 214,216, 218, 

220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 
250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286. 288, 290, 292, 294, 296. 298 over a region of at least 
about 100 residues, as determined by analysis with a sequence comparison 
35 algorithm or by visual inspection; 
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S under conditions which facilitate the breakdown of the haloalkane or 
halocatboxylic acid. 

73. An assay for identifying functional polypeptide fragments or variants encoded by 
fiagments of SEQ IDNOS: 1, 3, 5, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 

10 35, 37. 39, 41. 43. 45 ,47. 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69. 71, 73, 75, 77, 
79. 81, 83. 85, 87. 89, 91, 93, 95. 97, 99. 101. 103, 105, 107, 109, 11 1, 113, 115, 
117, 119, 121, 123, 125, 127. 129, 131, 133, 135, 137, 139, 141. 143, 145, 147. 
149, 151. 153, 155. 157. 159, 161, 163, 165. 167, 169. 171. 173. 175, 177. 179, 
181. 183, 185, 187, 189, 191, 193. 195, 197, 199, 201, 203, 205, 207, 209, 21 1. 

15 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243. 
245, 247. 249, 251, 253. 255, 257, 259, 261, 263, 265. 267. 269, 271, 273, 275, 
277, 279, 281. 283, 285, 287, 289, 291, 293, 295, 297, 299, and sequences having 
at least about 50% homology to SEQ ID NOS: 1, 3. 5, 9, 1 1. 13. 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39. 41, 43, 45 ,47, 49, 51, 53, 55, 57. 59. 61, 63, 65, 

20 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99. 101, 103. 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123. 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159. 161, 163, 165, 167, 169, 
171, 173. 175. 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201, 
203, 205, 207, 209. 211, 213. 215, 217, 219, 221, 223, 225. 227. 229, 231. 233. 

25 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 
267, 269, 271. 273, 275, 277, 279, 281, 283, 285, 287, 289. 291. 293. 295. 297. 
299 over a region of at least about 100 residues, as determined by analysis with a 
sequence comparison algorithm or by visual inspection, which retain at least one 
propCTtyofthepolypeptidesofSEQIDNOS: 2.4.6, 10, 12, 14, 16, 18,20,22, 

30 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60. 62. 64. 66. 
68, 70, 72, 74, 76, 78, 80. 82, 84, 86. 88. 90. 92. 94, 96, 98, 100, 102, 104, 106. 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126. 128. 130. 132. 134. 136. 138, 
140, 142, 144, 146, 148, 150, 152, 154. 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186. 188. 190. 192. 194. 196. 198,202,204, 

35 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 
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5 270. 272, 274, 276, 278, 280. 282, 284. 286, 288. 290. 292, 294. 296. 298. and 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6, 10, 12, 
14. 16. 18. 20. 22. 24, 26. 28. 30, 32, 34, 36, 38. 40. 42, 44. 46, 48. 50. 52. 54, 56. 
58, 60. 62, 64. 66. 68. 70. 72. 74, 76, 78, 80. 82, 84. 86. 88. 90. 92. 94, 96. 98, 
100, 102, 104. 106, 108, 110, 112, 114, 116, 118. 120, 122. 124, 126, 128. 130. 

10 132, 134, 136. 138. 140. 142, 144, 146. 148, 150, 152. 154. 156. 158. 160, 162, 
164, 166, 168, 170, 172, 174. 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 202. 204. 206. 208. 210, 212, 214, 216, 218, 220. 222, 224. 226, 228. 
230. 232. 234, 236, 238, 240, 242, 244, 246. 248, 250. 252. 254, 256, 258, 260, 
262, 264, 266, 268, 270, 272, 274, 276, 278. 280, 282. 284. 286. 288. 290. 292, 

1 5 294, 296, 298. over a region of at least about 1 00 residues, as determined by 

analysis with a sequoice comparison algorithm or by visual inspection, said assay 
comprising the steps of: 

contacting the polypeptide of SEQ ID NOS: 2. 4, 6. 10, 12. 14. 16. 18. 20, 22, 
24. 26, 28. 30, 32, 34. 36. 38, 40. 42, 44. 46. 48. 50, 52. 54, 56, 58. 60, 62. 64, 66, 

20 68. 70, 72, 74. 76. 78. 80. 82, 84. 86. 88. 90, 92, 94. 96, 98. 100, 102, 104, 106, 
108, 110, 112. 114, 116, 118, 120. 122, 124. 126, 128. 130. 132. 134, 136. 138. 
140, 142, 144, 146, 148, 150, 152, 154, 156. 158, 160. 162. 164. 166, 168, 170. 
172. 174, 176, 178, 180, 182, 184, 186, 188, 190, 192. 194, 196. 198,202,204, 
206, 208, 210, 212, 214, 216, 218, 220. 222. 224, 226, 228, 230. 232, 234, 236, 

25 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258, 260. 262. 264, 266, 268, 
270, 272, 274, 276. 278. 280. 282, 284, 286. 288, 290, 292, 294. 296, 298, and 
sequences having at least about 50% homology to SEQ ID NOS: 2, 4, 6. 10, 12, 
14, 16. 18, 20, 22, 24. 26, 28, 30, 32, 34. 36, 38. 40, 42, 44, 46, 48, 50, 52, 54. 56. 
58. 60. 62, 64. 66, 68. 70, 72, 74, 76, 78. 80, 82. 84, 86, 88, 90, 92, 94, 96, 98, 

30 100. 102, 104, 106, 108, 1 10, 1 12. 1 14, 1 16, 1 18. 120. 122, 124, 126, 128, 130, 
132, 134. 136. 138. 140, 142, 144, 146, 148. 150, 152, 154, 156, 158, 160, 162, 
164, 166. 168, 170, 172. 174. 176, 178, 180, 182, 184. 186. 188. 190. 192. 194, 
196, 198, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220. 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250. 252. 254. 256. 258, 260, 

35 262, 264, 266, 268, 270, 272, 274, 276. 278, 280, 282, 284. 286, 288. 290. 292. 
294, 296. 298 over a region of at least about 100 residues, as determined by 
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S analysis with a sequence comparison algorithm or by visual inspection, 

polypeptide fragments or variants encoded by SEQ ID NOS: 1, 3, 5, 9, 11, 13, 15, 
17. 19, 21, 23, 25, 27. 29. 31. 33. 35. 37. 39. 41, 43, 45 ,47. 49. 51, 53, 55, 57, 59, 
61, 63. 65. 67, 69, 71, 73, 75, 77, 79, 81. 83. 85, 87, 89, 91, 93. 95. 97, 99, 101. 
103. 105. 107. 109, III. 113, 115. 117, 119, 121, 123. 125, 127. 129. 131. 133. 

10 135, 137, 139, 141, 143, 145, 147, 149. 151, 153, 155, 157, 159, 161, 163, 165. 
167. 169. 171. 173. 175. 177, 179. 181, 183. 185. 187. 189, 191. 193. 195. 197, 
199, 201, 203, 205, 207, 209. 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 
231, 233, 235, 237, 239, 241, 243. 245, 247, 249, 251, 253, 255, 257, 259. 261. 
263, 265, 267, 269, 271, 273. 275, 277. 279. 281. 283, 285, 287, 289, 291. 293. 

15 295, 297, 299, sequences having at least about 50% homology to SEQ ID NOS: 1, 
3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29. 31, 33, 35, 37, 39. 41. 43. 45 .47, 49. 
51, 53, 55, 57, 59, 61, 63. 65. 67, 69, 71, 73. 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95,97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159. 

20 161. 163. 165. 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 
225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249. 251, 253, 255, 
257, 259, 261. 263, 265, 267, 269. 271. 273. 275. 277. 279, 281, 283, 285, 287, 
289, 291, 293, 295, 297, 299 over a region of at least about 100 residues, as 

25 determined by analysis with a sequence comparison algorithm or by visual 

inspection, and sequences complementary to any of the foregoing sequences, with 
a substrate molecule under conditions which allow the particular polypeptide to 
function; and 

detecting either a decrease in an amount of a substrate or an increase in an 
30 amount of a reaction product which results from a reaction between said 

polypeptide and said substrate; wherein a decrease in the amount of the substrate 
or an increase in the amount of the reaction product is indicative of existoice of 
the functional polypeptide. 

35 74. A nucleic acid probe comprising an oligonucleotide from about 10 to 50 

nucleotides in length and having a segment of at least 10 contiguous nucleotides 
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5 that is at least 50% complementary to a nucleic acid target region of a nucleic acid 
sequence selected from the group consisting of SEQ ID NOS.: 1, 3. 5, 9, 1 1, 13, 
15. 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57. 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93. 95, 97. 99, 
101, 103. 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129. 131. 

10 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 167, 169, 171, 173. 175. 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 
197, 199, 201, 203, 205, 207, 209, 21 1. 213, 215, 217, 219. 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 
261, 263, 265, 267, 269, 271, 273, 275. 277. 279, 281, 283, 285, 287, 289, 291, 

15 293, 295, 297, 299; and which hybridizes to the nucleic acid target region under 
moderate to highly stringent conditions to form a detectable target:probe duplex. 

75. The probe of claim 74, wherein the oligonucleotide is DNA, 

20 76. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 55% complementary to the nucleic acid 
target region. 

77. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
25 contiguous nucleotides that is at least 60% complementary to the nucleic acid 

target region. 

78. The probe of claim 74, wherem the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 65% complementary to the nucleic acid 

30 target region. 

79. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 70% complementary to the nucleic acid 
target region. 

35 
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S 80. The probe of claim 74, whmin the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 75% complementary to the nucleic acid 
target region. 

81. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
10 contiguous nucleotides that is at least 80% complementary to the nucleic acid 

target region. 

82. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 85% complementary to the nucleic acid 

15 target region. 

83. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 90% complementary to the nucleic acid 
target region. 

20 

84. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is at least 95% complementary to the nucleic acid 
target region. 

25 85. The probe of claim 74, wherein the oligonucleotide has a segment of at least 10 
contiguous nucleotides that is fully complementary to the nucleic acid target 
region. 

86. The probe of claim 74, wherein the oligonucleotide is 15-50 bases in length. 

30 

87. The probe of claim 74, wherein the probe further comprises a detectable isotopic 
label. 

88. The probe of claim 74, wherein the probe further comprises a detectable non- 
35 isotopic label selected from the group consisting of a fluorescent molecule, a 
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5 chemiluminescent molecule, an enzyme, a cofactor> an enzyme substrate, and a 
hapten. 

89. The probe of claim 86, wherein the oligonucleotide has a segment of at least IS 
contiguous nucleotides that is at least 90% complementary to the nucleic acid 

10 target region, and which hybridizes to the nucleic acid target region under 

moderate to highly stringent conditions to form a detectable target:probe duplex. 

90. The probe of claim 86, wherein the oligonucleotide has a segment of at least IS 
contiguous nucleotides that is at least 9S% complementary to a nucleic acid target 

1 5 region, and which hybridizes to the nucleic acid target region under moderate to 
highly stringent conditions to form a detectable target:probe duplex. 

91. The probe of claim 86, wherein the oligonucleotide has a segment of at least IS 
contiguous nucleotides that is at least 97% complementary to a nucleic acid target 

20 region, and which hybridizes to the nucleic acid target region under moderate to 
highly stringent conditions to form a detectable target:probe duplex. 

92. A polynucleotide probe for isolation or identification of alpha amylase genes 
having a sequence which is the same as, or fully complementary to at least a 

25 fragment of one of SEQ ID NOS.: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37, 39, 41. 43, 45 ,47, 49, 51, 53. 55, 57. 59, 61, 63, 65, 67, 69, 71, 73, 
75. 77, 79, 81, 83, 85, 87, 89, 91. 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1. 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137. 139. 141, 143, 
145. 147, 149, 151, 153, 155, 157, 159, 161, 163, 165. 167, 169, 171. 173. 175, 

30 177, 179, 181, 183, 185, 187, 189. 191, 193, 195, 197, 199, 201, 203, 205, 207, 
209, 211, 213, 215, 217, 219. 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 
241, 243, 245. 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275. 277, 279, 281, 283, 285, 287, 289. 291, 293, 295, 297. 299. 
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5 93. A protein preparation comprising a polypeptide selected from the group consisting 
of: polypeptides having an amino acid sequence selected from the group 
consisting of: 

SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34. 
36. 38, 40, 42. 44, 46, 48, 50. 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76. 

10 78, 80, 82, 84. 86. 88. 90. 92, 94. 96. 98, 100, 102, 104, 106. 108, 1 10, 1 12, 

114, 116, 118, 120, 122, 124, 126. 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158. 160. 162. 164. 166. 168, 170, 172, 
174, 176, 178, 180, 182, 184. 186, 188, 190, 192, 194, 196, 198. 202, 204, 
206, 208, 210, 212, 214, 216, 218. 220. 222. 224. 226, 228, 230, 232, 234, 

15 236, 238, 240, 242, 244, 246. 248, 250, 252, 254, 256. 258, 260, 262, 264, 

266, 268, 270, 272, 274, 276, 278. 280, 282, 284. 286, 288, 290, 292, 294. 
296,298; 

variants having at least about 50% homology to at least one of SEQ ID 
NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20. 22. 24, 26, 28, 30. 32. 34, 36. 38, 40, 42, 
20 44, 46, 48. 50. 52. 54. 56, 58, 60, 62. 64. 66, 68. 70. 72, 74, 76. 78. 80. 82. 84, 

86, 88, 90, 92, 94. 96. 98, 100. 102. 104. 106. 108, 110, 1 12, 1 14. 116, 118. 
120. 122, 124, 126, 128, 130. 132. 134, 136, 138, 140. 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160. 162, 164. 166. 168, 170. 172, 174, 176, 178, 
180, 182, 184. 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210. 
25 212, 214. 216. 218, 220, 222. 224, 226, 228. 230, 232, 234, 236, 238, 240, 

242. 244. 246, 248, 250, 252, 254, 256. 258. 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288. 290. 292. 294, 296, 298, over a 
region of at least about 100 residues, as determined by analysis with a 
sequence comparison algorithm or by visual inspection; 
30 sequences complementary to SEQ ID NOS: 2, 4. 6. 10. 12, 14, 1 6, 1 8, 

20. 22. 24. 26, 28, 30, 32. 34, 36. 38. 40, 42, 44, 46, 48, 50. 52, 54. 56. 58. 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82. 84, 86, 88. 90. 92. 94, 96, 98, 100, 102, 104. 106. 
108, 110, 112, 114, 116, 118. 120, 122. 124, 126, 128, 130, 132, 134. 136. 138. 140, 
142, 144, 146, 148, 150, 152, 154. 156. 158, 160, 162, 164, 166, 168, 170, 172, 174, 
35 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196. 198. 202, 204, 206, 208, 210, 
212, 214. 216, 218. 220, 222. 224, 226, 228, 230. 232, 234, 236. 238, 240, 242. 244. 
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5 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292. 294, 296, 298; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ID NOS: 2, 4, 6, 10, 12. 14, 16, 18, 20, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42. 44, 46, 48. 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70. 72, 
10 74. 76. 78, 80, 82, 84. 86. 88, 90, 92, 94. 96. 98, 100, 102. 104. 106. 108, 1 10. 

112. 114, 116. 118, 120, 122, 124, 126. 128. 130. 132, 134, 136, 138. 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202, 
204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224. 226, 228. 230, 232. 
15 234, 236, 238, 240, 242, 244, 246. 248, 250. 252, 254, 256, 258, 260, 262, 

264, 266. 268. 270, 272, 274. 276, 278, 280, 282, 284, 286. 288. 290. 292, 
294, 296. 298 over a region of at least about 100 residues, as determined by 
analysis with a sequence comparison algorithm or by visual inspection; and 
polypeptides having at least 10 consecutive amino acids of a polypeptide having a 
20 sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 10. 12. 14, 16, 
18. 20, 22, 24, 26, 28, 30. 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52. 54, 56, 58, 60, 62. 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88. 90. 92, 94, 96. 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170, 172. 
25 174, 176, 178, 180, 182, 184. 186. 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 
210. 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248. 250. 252, 254, 256. 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 
wherein the protein preparation is a liquid. 

30 

94. A protein preparation comprising a polypeptide selected from the group consisting 
of: polypeptides having an amino acid sequence selected from the group 
consisting of: 

SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
35 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62. 64, 66, 68, 70, 72, 74, 76, 

78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 
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5 114, 116. 118, 120. 122, 124. 126, 128, 130, 132. 134, 136, 138, 140. 142, 

144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170, 172, 
174, 176, 178, 180. 182, 184, 186, 188, 190, 192, 194, 196. 198, 202, 204, 
206, 208, 210, 212, 214, 216, 218, 220, 222. 224, 226. 228, 230. 232. 234. 
236, 238, 240. 242. 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 
10 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 

296, 298; 

variants having at least about 50% homology to at least one of SEQ ID 
NOS: 2, 4, 6, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56. 58, 60, 62. 64. 66. 68, 70, 72. 74, 76, 78, 80, 82, 84, 
15 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 118, 

120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170, 172, 174, 176, 178, 
180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 232, 234. 236, 238. 240, 
20 242. 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 

272, 274, 276, 278, 280, 282, 284, 286, 288. 290. 292, 294. 296. 298, over a 
region of at least about 100 residues, as determined by analysis with a 
sequence comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 18, 
25 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46. 48, 50. 52, 54, 56. 58. 60, 62, 64, 
66, 68, 70, 72, 74. 76. 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102. 104, 106. 
108. 110. 112. 114, 116. 118. 120. 122. 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146. 148. 150, 152. 154, 156, 158, 160, 162. 164, 166. 168, 170, 172, 174, 
176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 202, 204, 206, 208, 210, 
30 212, 214, 216, 218, 220. 222, 224. 226, 228, 230, 232, 234, 236, 238, 240, 242. 244, 
246, 248. 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276. 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298; and 

sequences complementary to variants having at least about 50% 
homology to SEQ ID NOS: 2, 4. 6. 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 
35 32, 34. 36, 38, 40, 42. 44. 46. 48. 50. 52, 54, 56, 58, 60, 62, 64, 66. 68, 70, 72, 

74, 76, 78. 80. 82, 84, 86, 88, 90, 92, 94. 96, 98, 100, 102, 104, 106, 108, 1 10, 
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5 112, 114. 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 

142, 144, 146, 148, 150. 152, 154, 156, 158, 160. 162. 164, 166, 168, 170, 
172. 174. 176. 178, 180, 182. 184, 186, 188, 190, 192, 194. 196. 198, 202. 
204. 206, 208, 210, 212. 214. 216. 218, 220, 222, 224. 226. 228, 230, 232, 
234,236, 238. 240. 242, 244. 246, 248, 250, 252, 254, 256, 258. 260. 262, 
10 264, 266, 268, 270. 272, 274, 276, 278, 280, 282, 284, 286, 288, 290. 292, 

294, 296, 298 over a region of at least about 100 residues, as determined by 
analysis with a sequence comparison algorithm or by visual inspection; and 
polypeptides having at least 10 consecutive amino acids of a polypeptide having a 
sequence selected from the group consisting of SEQ ID NOS: 2, 4, 6, 10, 12, 14, 16, 
15 18, 20, 22, 24, 26. 28, 30, 32, 34. 36, 38, 40. 42. 44, 46, 48, 50, 52, 54, 56, 58. 60. 62. 
64, 66. 68, 70, 72, 74. 76, 78. 80, 82, 84. 86, 88. 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116. 118, 120. 122, 124, 126, 128, 130, 132. 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154. 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,202,204,206,208, 
20 210, 212, 214, 216, 218, 220, 222, 224. 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296. 298; and 
wherein the polypeptide is a solid. 

25 95. A method for modifying small molecules, comprising the step of mixing at least 
one polypeptide encoded by a polynucleotide selected from the group consisting 
of: 

SEQ ID NOS.: 1, 3, 5, 9, 11, 13, 15, 17, 19. 21. 23, 25. 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45 ,47, 49, 51, 53, 55, 57. 59, 61, 63, 65, 67, 69, 71. 73. 75, 77, 79, 81, 83, 

30 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111. 113, 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175. 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195, 197, 199,201,203,205,207,209.211,213,215,217,219,221,223, 
225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 

35 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283. 285, 287, 289, 291, 
293, 295, 297, 299; 
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5 variants having at least about 50% homology to at least one of SEQ ID NOS.: 

1, 3, 5. 9. 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 
51, 53, 55. 57, 59, 61. 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97.99, 101, 103. 105. 107. 109. Ill, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
131. 133, 135, 137, 139, 141, 143. 145, 147, 149, 151, 153, 155, 157, 159. 161, 163, 

10 165. 167. 169. 171, 173. 175. 177. 179, 181, 183, 185, 187. 189, 191, 193, 195, 197, 
199, 201, 203, 205, 207. 209. 21 1, 213, 215, 217. 219. 221, 223, 225, 227, 229, 231, 
233, 235, 237, 239. 241, 243. 245. 247. 249, 25 1. 253. 255. 257, 259, 261, 263, 265, 
267, 269, 271, 273, 275, 277, 279, 281, 283. 285. 287, 289. 291, 293, 295. 297, 299, 
over a region of at least about 100 residues, as determined by analysis with a sequence 

1 5 comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS.: 1, 3, 5, 9, 1 1. 13, 15. 17, 19, 21, 
23, 25, 27, 29, 31, 33. 35. 37, 39. 41, 43. 45 ,47. 49. 51, 53. 55. 57. 59, 61, 63, 65. 67. 
69. 71. 73. 75. 77, 79. 81, 83, 85, 87. 89, 91. 93, 95. 97. 99. 101. 103. 105, 107. 109, 
111, 113, 115, 117, 119, 121, 123. 125, 127, 129, 131, 133, 135, 137. 139, 141. 143. 

20 145. 147. 149, 151. 153. 155. 157, 159, 161, 163. 165. 167. 169, 171, 173. 175, 177. 
179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211. 
213, 215. 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259. 261. 263. 265, 267, 269, 271, 273. 275. 277, 279. 
281. 283, 285, 287. 289, 291, 293, 295, 297, 299; and 

25 sequences complementary to variants having at least about 50% homology to 

SEQ ID NOS.: 1, 3. 5, 9, 11. 13. 15. 17, 19, 21. 23, 25. 27, 29, 31. 33, 35. 37. 39, 41, 
43, 45 .47. 49. 51. 53. 55, 57. 59. 61. 63, 65, 67, 69. 71, 73, 75, 77. 79, 81, 83, 85, 87, 
89, 91. 93.95.97. 99. 101. 103. 105. 107, 109. Ill, 113. 115, 117. 119. 121. 123, 125, 
127, 129, 131, 133. 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 

30 161, 163, 165, 167, 169, 171, 173, 175. 177. 179. 181. 183. 185, 187. 189, 191. 193. 
195. 197, 199. 201. 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225. 227, 
229, 231, 233, 235, 237. 239. 241, 243, 245, 247, 249. 251. 253, 255. 257, 259, 261. 
263, 265, 267. 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 
297, 299, over a region of at least about 100 residues, as determined by analysis with 

35 a sequence comparison algorithm or by visual inspection; and fragmoits of any of the 
foregoing polypeptides; 
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S with at least one small molecule to produce at least one modified small molecule via 
at least one biocatalytic reaction, wherein the at least one polypeptide has alpha 
amylase activity. 

96. The method of claim 95, wherein the at least one polypeptide comprises a 
10 plurality of polypeptides and the at least one small molecule comprises a plurality 
of small molecules, whereby a plurality of modified small molecules are produced 
via a plurality of biocatalytic reactions to form a library of modified small 
molecules. 

IS 97. The method of 96, further comprising the step of testing the library to determine if 
a particular modified small molecule, which exhibits a desired activity is present 
within the library. 

98. The method of claim 97 wherein the step of testing the library further comprises 
20 the steps of: 

systematically eliminating all but one of the biocatalytic reactions used to 
produce a portion of the plurality of the modified small molecules within the library 
by testing the portion of the modified small molecule for the presence or absence of 
the particular modified small molecule with the desired activity, and identifying a 
25 specific biocatalytic reaction which produces the particular modified small molecule 
of desired activity. 

99. The method of claim 98 wherein the specific biocatalytic reaction, which produces 
the modified small molecule of desired activity is repeated. 

30 100. The method of claim 93 wherein the biocatalytic reactions are conducted with 

a group of biocatalysts that react with distinct structural moieties found within the at 

least one small molecule; 

each biocatalyst is specific for a particular structural moiety or a group of 

related structural moieties; and 
35 each biocatalyst reacts with a plurality of small molecules which contain the 

particular structural moiety specific to the particular biocatalyst. 



wo 02/068589 



PCT/US02/05068 



178 

5 

101. A cloning vector comprising a sequence that encodes a polypeptide having alpha 
amylase activity, said sequence being selected from the group consisting of: 

SEQ ID NOS.: 1, 3. 5, 9, 1 1, 13, 15, 17, 19. 21, 23, 25, 27, 29, 31. 33, 35, 37, 
39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 

10 85,87,89,91,93,95,97,99, 101. 103, 105, 107, 109, 111, 113. 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, lisi, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195. 197. 199. 201. 203. 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 
225, 227, 229, 23 1, 233, 235, 237. 239, 241, 243, 245, 247, 249, 25 1, 253, 255, 257, 

15 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291. 
293,295,297,299; 

variants having at least about 50% homology to at least one of SEQ ID NOS.: 
1.3,5,9, 11, 13. 15, 17, 19,21,23,25,27.29.31.33.35,37,39,41,43.45,47,49, 
51, 53, 55, 57, 59, 61, 63. 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87. 89, 91, 93, 95, 

20 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123. 125, 127, 129. 
131. 133, 135. 137, 139, 141, 143, 145, 147, 149. 151, 153, 155, 157, 159, 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179. 181, 183, 185, 187, 189, 191, 193, 195, 197, 
199, 201. 203, 205, 207, 209, 211, 213. 215, 217, 219, 221, 223, 225, 227, 229, 231, 
233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261. 263. 265, 

25 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
over a region of at least about 100 residues, as determined by analysis with a sequence 
comparison algoridmi or by visual inspection; 

sequences complementary to SEQ ID NOS.: 1, 3, 5, 9, 1 1, 13, 15, 17. 19, 21, 
23, 25, 27, 29, 31, 33, 35. 37, 39, 41, 43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 

30 69. 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129. 131, 133, 135, 137. 139, 141, 143, 
145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165. 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185. 187, 189, 191, 193, 195, 197, 199,201,203,205, 207,209,211, 
213, 215, 217. 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 

35 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271. 273. 275, 277, 279, 
281, 283, 285, 287, 289, 291, 293, 295, 297, 299; and 
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5 sequences complementary to variants having at least about 50% homology to 

SEQ ID NOS.: 1, 3. 5. 9, 11, 13. 15, 17, 19, 21, 23, 25, 27, 29. 31, 33, 35, 37, 39, 41. 
43, 45 ,47, 49, 51, 53, 55, 57, 59, 61, 63. 65, 67, 69. 71, 73, 75, 77, 79, 81, 83, 85, 87, 
89,91, 93,95,97,99, 101. 103. 105. 107, 109. Ill, 113. 115, 117, 119, 121, 123, 125, 
127, 129. 131. 133, 135, 137. 139, 141, 143, 145, 147, 149, 151, 153. 155, 157, 159. 

10 161. 163, 165, 167. 169. 171, 173. 175, 177, 179. 181. 183. 185. 187. 189. 191. 193, 
195. 197. 199, 201, 203, 205. 207. 209. 211, 213. 215, 217, 219. 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 
263, 265, 267. 269. 271. 273, 275. 277. 279, 281. 283. 285, 287, 289, 291, 293. 295, 
297, 299, over a region of at least about 100 residues, as determined by analysis with 

15 a sequence comparison algorithm or by visual inspection. 

102. A host cell comprising a sequence that encodes a polypeptide having alpha 
amylase activity, said sequence being selected from the group consisting of: 

SEQ ID NOS.: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33. 35, 37, 

20 39, 41, 43, 45 ,47, 49, 51. 53, 55, 57, 59, 61, 63, 65, 67, 69, 71. 73, 75, 77. 79, 81, 83, 
85, 87. 89. 91. 93, 95. 97, 99, 101, 103, 105, 107, 109,1 1 1, 1 13, 1 15, 1 17, 1 19, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149. 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195, 197, 199,201,203,205,207,209,211,213,215,217,219,221,223, 

25 225, 227, 229, 231, 233. 235, 237, 239, 241, 243, 245, 247. 249. 251. 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281. 283, 285, 287, 289, 291, 
293,295, 297,299; 

variants having at least about 50% homology to at least one of SEQ ID NOS.: 
1, 3, 5, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27. 29. 31. 33. 35. 37. 39. 41. 43,45 ,47, 49, 

30 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97,99, 101, 103, 105, 107, 109, 111. 113. 115, 117, 119, 121, 123, 125, 127, 129, 
131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 

35 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 
267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289. 291. 293. 295. 297. 299, 
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5 over a region of at least about 1 00 residues, as determined by analysis with a sequence 
comparison algorithm or by visual inspection; 

sequences complementary to SEQ ID NOS.: 1, 3, 5, 9, 1 1, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51. 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71. 73, 75, 77. 79, 81, 83, 85, 87, 89, 91, 93. 95. 97. 99, 101. 103. 105. 107, 109. 

10 111, 113. 115. 117, 119, 121, 123.125, 127, 129, 131. 133. 135, 137. 139, 141. 143, 
145, 147. 149. 151, 153, 155, 157, 159. 161. 163, 165. 167, 169. 171, 173, 175. 177, 
179, 181, 183, 185, 187, 189, 191. 193, 195, 197. 199, 201, 203, 205. 207. 209, 21 1, 
213, 215, 217, 219. 221. 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269. 271, 273, 275, 277, 279, 

15 281, 283, 285, 287, 289, 291, 293, 295, 297, 299; and 

sequences complementary to variants having at least about 50% homology to 
SEQ ID NOS.: 1, 3, 5, 9, 11, 13, 15, 17, 19, 21, 23. 25, 27, 29, 31, 33, 35. 37, 39, 41, 
43, 45 ,47. 49. 51, 53, 55, 57, 59, 61, 63, 65. 67, 69, 71, 73, 75, 77, 79. 81, 83, 85, 87, 
89,91, 93, 95,97, 99, 101, 103. 105, 107, 109, 111, 113, 115, 117, 119. 121, 123, 125, 

20 127, 129, 131, 133, 135, 137, 139. 141, 143, 145, 147, 149, 151, 153, 155. 157, 159, 
161, 163. 165. 167. 169, 171, 173, 175, 177. 179, 181, 183, 185, 187. 189, 191, 193, 
195, 197. 199, 201, 203, 205, 207, 209, 211, 213, 215. 217. 219, 221, 223, 225, 227, 
229, 231, 233, 235. 237. 239. 241. 243. 245, 247, 249, 251, 253, 255. 257, 259, 261. 
263, 265, 267, 269, 271, 273, 275, 277. 279. 281, 283, 285, 287, 289. 291, 293, 295, 

25 297. 299, over a region of at least about 100 residues, as determined by analysis with 
a sequence comparison algorithm or by visual inspection. 

103. An expression vector capable of replicating in a host cell comprising a 
polynucleotide having a sequence selected from the group consisting of SEQ ID 

30 NOS.: 1, 3, 5, 9. 11, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39,41,43, 45 
,47, 49, 51, 53, 55, 57, 59, 61. 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 
93,95,97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129. 131. 133, 135, 137, 139, 141. 143, 145, 147, 149. 151, 153, 155, 157, 159, 161, 
163, 165, 167, 169, 171, 173. 175, 177, 179, 181, 183, 185, 187, 189, 191, 193. 195, 

35 197, 199, 201, 203. 205, 207, 209, 211. 213, 215, 217, 219, 221, 223, 225, 227, 229. 
231. 233, 235, 237, 239, 241, 243, 245, 247. 249, 251, 253, 255, 257, 259, 261, 263. 
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5 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
299, variants having at least about 50% homology to SEQ ID NOS.: 1, 3, 5, 9, 1 1, 13, 
15, 17. 19. 21. 23. 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51. 53, 55, 57, 59. 
61, 63. 65. 67. 69. 71. 73. 75. 77. 79. 81. 83. 85. 87, 89, 91. 93, 95, 97, 99, 101, 103, 
105. 107, 109, 111. 113. 115, 117. 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 

10 139, 141, 143, 145. 147. 149. 151, 153. 155, 157. 159, 161, 163, 165, 167, 169, 171, 
173, 175, 177. 179. 181. 183. 185. 187. 189. 191. 193. 195. 197. 199. 201, 203, 205, 
207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229. 231. 233, 235, 237. 239, 
241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 
275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299 over a region of at 

1 5 least about 100 residues, as determined by analysis with a sequence comparison 

algorithm or by visual inspection, sequences complementary to SEQ ID NOS.: 1 , 3, 5, 
9, 1 1, 13, 15. 17. 19. 21. 23, 25; 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 ,47, 49, 51, 53, 
55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77. 79. 81, 83, 85, 87, 89, 91, 93, 95. 97, 99, 
101, 103, 105, 107, 109, 111, 113, 115. 117. 119. 121. 123. 125. 127. 129, 131, 133, 

20 135, 137, 139, 141, 143, 145, 147, 149, 151, 153. 155. 157, 159. 161, 163. 165. 167. 
169. 171, 173, 175, 177, 179, 181, 183, 185, 187, 189. 191. 193. 195. 197. 199.201, 
203, 205, 207, 209. 211. 213. 215. 217. 219, 221. 223, 225, 227. 229, 231, 233, 235, 
237, 239, 241, 243, 245, 247, 249, 251, 253. 255. 257, 259, 261, 263, 265, 267, 269. 
271. 273, 275. 277, 279. 281. 283. 285, 287, 289, 291, 293, 295, 297, 299, and 

25 sequences complementary to variants having at least about 50% homology to SEQ ID 
NOS.:.l, 3, 5, 9, 1 1, 13, 15, 17. 19. 21. 23. 25, 27, 29, 31, 33, 35, 37. 39, 41, 43, 45 
.47, 49. 51. 53. 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 
93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115. 117. 119. 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 

30 163, 165, 167, 169, 171, 173, 175, 177, 179. 181, 183, 185, 187, 189, 191. 193, 195. 
197. 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229. 
231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253. 255, 257, 259, 261, 263, 
265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
299 over a region of at least about 100 residues, as determined by analysis with a 

35 sequence comparison algorithm or by visual inspection, and isolated nucleic acids that 
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5 hybridize to nucleic acids having any of the foregoing sequences under conditions of 
low, moderate and high stringency. 

104. A vector as claimed in claim 101 or 103, wherein the vector is selected from 
the group consisting of viral vectors, plasmid vectors, phage vectors, phagemid - 

10 vectors, cosmids, fosmids, bacteriophages, artificial chromosomes, adenovirus 
vectors, retroviral vectors, and adeno-associated viral vectors. 

105. A host cell comprising an expression vector as claimed in claim 103. 

IS 106. A host cell as claimed in claim 47, 102, 103 or lOS, wherein the host is 

selected from the group consisting of prokaryotes, eukaryotes, funguses, yeasts, plants 
and metabolically rich hosts. 

107. A method for liquifying a starch containing composition comprising 
20 contacting the starch with a polypeptide of claim 30. 

108. A liquified syrup produced by the method of claim 107. 

109. A liquified syrup having the oligosaccharide profile of the syrup of claim 108. 

25 

110. A liquified syrup having the NN/MW ration of less than 2.0 at 1 8 DE and less 
than about 4.0 at 12 DE. 

111. A method for washing an object comprising contacting said object with a 
30 polypeptide of claim 30 under conditions sufficient for said washing. 

112. A method for textile desizing comprising contacting said textile with a 
polypeptide of claim 30 under conditions sufficient for said desizing. 
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S 113. A method for the treatment of lignocellulosic fibers, wherein the fibers are 
treated with a polypeptide of claim 30, in an amount which is efficient for improving 
the fiber properties. 

114. A method according to claim 1 1 3 for enzymatic deinking of recycled paper 
10 pulp, wherein the polypeptide of claim 30 is applied in an amount which is efficient 
for effective deinking of the fiber surface. 

lis. A method for starch liquefaction comprising contacting said starch with with a 
polypeptide of claim 30 under conditions sufficient for said liquefaction. 

15 

116. A detergent additive comprising with a polypeptide of claim 30. 

117. A method for producing a high-maltose or a high-glucose syrup or a mixed 
syrup comprising: 

20 liquefying starch using an effective amount of a polypeptide of claim 30 to obtain a 
soluble starch hydrolysate; and 

saccharifying the soluble starch hydrolysate, thereby resulting in a syrup. 

1 1 8. The method as in any of claims 107, 1 1 S, or 1 1 7, wherein the starch is from a 
25 material selected from rice, germinated rice, com, barley, wheat, legumes and sweet 

potato. 

119. The method as in any of claims 107, 115, or 117, further comprising addition of a 
second alpha amylase or a beta amylase or a combination thereof 

30 

120. A method of increasing the flow of production fluids from a subterranean 
formation by removing a viscous, starch-containing, damaging fluid formed during 
production operations and found within the subterranean formation which surrounds a 
completed well bore comprising: 

35 allowing production fluids to flow from the well bore; 
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S reducing the flow of production fluids from the formation below expected 

flow rates; 

fonnulating an enzyme treatment by blending together an aqueous fluid and a 
polypeptide of claim 30; 

pumping the enzyme treatment to a desired location within the well bore; 
1 0 allowing the enzyme treatment to degrade the viscous, starch-containing, 

damaging fluid, whereby the fluid can be removed from the subterranean formation to 
the well surface; and 

wherein the enzyme treatment is effective to attack the alpha glucosidic 
linkages in the starch-containing fluid. 



15 
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DISPLAY HOMOLOGY LEVEL BETWEEN THE FIRST AND 
SECOND SEQUENCES 



278 

END ""^y^ 



FIGURE 2 
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300 
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(. START ^^y^ 



NO 



304 



STORE A FIRST SEQUENCE TO A MEMORY 



OPEN A DATABASE OF SEQUENCE FEATURES 



READ FIRST FEATURE FROM DATABASE 



306 

308 



COMPARE FEATURE ATTRIBUTES VmH THE FIRST 
SEQUENCE 



310 



READ NEXT 
FEATURE IN 
DATABASE 




326 



NO 
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( END 




Figure 5: Residual activity of various amylases following heating to 90X for 10 min. 
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Figure 6. Net percent starch removed vs. enzyme concentration in ADW wash test with 
bleach and chelators 
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Figure 7: Activity of parental amylases at pH 8, 40»C (black bare) in ADW fomulation 
at SS'C (gray bare). Values are tfie average of 384 wells with error bare representing the standard 
deviation. 
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Specific Activity (pH 8, 40 C) 
(witli activity >4 U/mg) 




Specific activity @ pH 10, 50C 
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Figure 10: Alignments of the genes proposed to be used in reassembly 



SEQ ID NO.: 113 (D 
SEQ ID N0.:127 (1> 
SEQ ID NO. : 115 (1) 



AANLN|Ti 

-QANTAPVNGTi 

akyseleqgI' 




D|TLgTg\^NgASglJSLilTglP^ 
GSIw5dtSS§KI PEWYlAgir^ 




(TS-QSDVp< 
iGGAYSg^DPi 




81 

SEQ ID NO.: 113 (76) |KygTpELQSA£KSLHSRDlg[VY^V 
SEQ ID NO.: 127 (79) ^KYGTKT^Qj^AAKSAGgQVY 
SEQ ID N0.:11S(81) ig^EEEI.2Ne^STAHQY6IKVlS 



160 



gTEOVTAVEVDPADRNgVI Sg^gl KAWTHFHFPGRGSTYSDFK 
RTEWVDAVEVNPSgRNQETSCTYQIQAWTKFOFPGRGNTYSSFK 
|LEHNPYV(^YT|Q|DPSSVASSKYgAHYKDFHPN 



161 

SEQ ID NO.:113(156) WHWYHFDGTgWDESgKLNR 
SEQ ID NO.: 127 (159) WRWYHFOGTBtfOESgKLNRl 
SEQ ID NO.:115(150) nSST^BGTFGGFPI^ 



jYKFQG - - KA|CWEVSKENGN YDYLMYADfDYDHPgVAAEgKRWG' 
!YKEgGTGKA|D§EV0gENGNY0YLMgADLDMDHPEVVTELKNWGT1 
pHLVPFHQYgliiASNES YF 



241 



SEQ ID NO. :113(234) 
SEQ ID NO. :127(239) 
SEQ ID MO. :11S(200) 



SEQ 10 NO. : 113.(312) 

SEQ ID NO. :127(317) 

SEQ ID NO.: 115 (272) |^1RD|i 



rtII^SFLgD|^HVREKTGKEg] 

filKYSFFPDgljHVRSQTRKNLFABG^SYDVWKit^YITKTS^si 
gGYGAWV^DiLiONGG - — - - - 






igGAfiEg^KTNPNHSl 
fK|l«YIT 



[DTMVOA|LS|AYSiG- -AE| 



FSD^^QPGQS lest VQT^FgP] 
"^D^^QPGQSLQSWV|p|FgP] 

--iiSnIypI 



320 

HYgFgAgSTQGGOYDMRKLLNG* - 
_„ WFYTgSKSSGl^DMRYLLN^ 
iFj^YRMDESFDHKNIPAgVYA 

400 

RESgYgQV^GlMYGTKGDSQ- -RBIPASKHKIEPgL 

RQE§YiCVg5GiYYGIPICYN IPgiKSKlgPLL 

YE-SqS^^^RIyEEWUIKD KLNNg 1 



401 480 

SEQ ID NO. : 1 1 3 ( 3 9 0 ) KARKQYi^YgAQnOYFDHH|lgGWT^D|sVANSGIJ^ITDGPGGAKRMYA^QNAGETWHDiTiNRS - -EPWIN^g 
SEQ ID NO.: 127 (392) I AR^Y5YpQ|DYIDHQ|lIGWT^IDS|PNSGLAAgITDGPGG|KWMYVgKKHAGK^ 
SEQ ID NO-: 115 (331) WIHEHLg<^TSlI/SYDD|Egl|M^YGDiPGL-|TT@n,GSDWgEiwgNvisKFAGYTIH|^^ 

481 560 

SEQ ID NO.: 113 (468) gGEFHVN gcgSlgVQR - -- 

SEQ ID NO.:127(470) gGEFKVN iGSjfS ifvSKTSQVTFTVNNATT I SGQNVYWGMI PEIX3NWNTANAI KMTPSSYPTWKATIALP 

SEQ ID NO . : lis ( 4 10 ) gVBXTAPPHDPANGYYgY^SjfAGVG - 



561 60S 

SEQ ID NO. :113 (485) - 

SEQ ID NO.: 127 (541) QGKAIEFKFIKKDQSGNWHESIPNRTYTVPFLSTGSYTASWNVP 
SEQ ID NO. :115 (437) 
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Figure 11: Example Standard Curve of the assay of Example S. 
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Figure 12: A graph of the pH rate profiles for 2 different amylases. BD71 88 is a control; an 
enzyme that was discovered previously and has a neutral pH optimum. BD7837 is a more recently 
discovered amylase and has an optimum around pH 10. Pure protein was used in these assays. 
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Figure 13: Stability of Diversa amylases vs. a commercial enzyme 
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Figure 14a: 

1 so 

SEQIDN0:81 ^KK FVALFITMFF WSMAW AQPASAAK 

pyro MKK FVALLITMFF WSMAAV AQPASAAK 

pyro2 VNIKK LTPLLTLLbF FI...VL ASPVSAAK 

thermo SESQCTATCT WRWYMSAKK LLALLFVLAV LVGVAVIPAR VGIAPVSAGA 

thermoa MA RKVLVALLVF LWLSVSAVP 

Consensus SA- - 

51 100 
SEQIDN0:81 YS..ELEEGG VIMQAFYWDV PGGGIWWDTI RSKIPEWYEA GISAXWIPPA 
pyro YS..ELEEGG VIMQAFYWDV PAGG I WW DTI RSKIPEWYEA GISAIWIPPA 
pyro2 YL..ELEEGG VIMQAFYWDV PGGGIWWDHI RSKIPEWYEA GISAIWLPPP 
thermo TSRPSLEEGG VIMQAFYWDV PAGGIWWDTI RSKIPDWASA GISAIWIPPA 
thermo2 AKAETLENGG VIMQAFYWDV PGGGIWWDTI AQKIPDWASA GISAIWIPPA 

Consensus LB-GG VIMQAFYWDV P-GGIWWD-I --KIP-W--A GISAIW-PP- 

Sense primer 

101 ISO 
SEQIDN0:81 SKGMSGGYSM GYDPYDFFDL GBYNQKGTIE TRFGSKQELI NMINTAHAYG 
pyro SKGMGGAYSM GYDPYDFFDL GEYNQKGTVE TRFGSKQELI NMINTAHAYG 
pyro2 SKGMSGGYSM GYDPYDYFDL GEYYQKGTVE TRFGSKEELV RLIQTAHAYG 
thermo SKGMSGAYSM GYDPYDFFDL GEYYQKGTVE TRFGSKQELI NMINTAHSYG 
thermo2 SKGMSGGYSM GYDPYDFFDL GEYYQKGSVE TRFGSKEELV NMINTAHAHN 
Consensus SKGM-G-YSM GYDPYD-FDL GEY-QKG--E TRFGSK-EL- --I~TAH 

151 200 

SEQIDN0:8l IKVIADIVIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

pyro IKVIADIVIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

pyro2 IKVIADWIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

thermo IKVIADIVIN HRAGGDLEWN PFTNSYTWTD FSKVASGKYT ANYLDFHPNE 

thermo2 MKVIADIVIN HRAGGDLEWN PFTNSYTWTD FSKVASGKYT ANYLDFHPNE 

Consensus -KVIAD-VIN HRAGGDLEWN PF YTWTD FSKVASGKYT ANYLDFHPNE 

201 25 0 

SEQIDN0:8I VKCCDEGTFG GFPDIAHEKS WDQHWLWASD ESYAAYLRSI GVDAWRFDYV 

pyro VKCCDEGTFG GFPDIAHEKE WDQHWLWASD ESYAAYLRSI GVDAWRFDYV 

pyro2 LHCCDEGTFG GFPDICHHKE WDQYWLWKSN ESYAAYLRSI GFDGWRFDYV 

thermo VKCCDEGTFG GFPDIAHEKS WDQYWLWASQ KSYAAYLRSI GIDAWRFDYV 

thermo2 LHAGDSGTFG GYPDICHDKS WDQHWLWASN ESYAAYLRSI GIDAWRFDYV 

Consensus D-GTFG G-PDI-H-K- WDQ-WLW-S- -SYAAYLRSI G-D-WRFDYV 

251 300 

SEQIDN0:8I KGYGAWWKD WLNWWG6WAV GBYWDTNVDA LLNWAYSSGA KVFDFPLYYK 

pyro KGYGAWWKD WLNWWGGWAV GEYWDTNVDA LLNWAYSSGA KVFDFPLYYK 

pyro2 KGYGAWWRD WLNWWGGWAV GEYWDTNVDA LLSWAYESGA KVFDFPLYYK 

thermo KGYGAWWKD WLKWW.ALAV GEYWDTNVDA LLNWAYSSGA KVFDFPLYYK 

thermo2 KGYAPWWKN WLNRWGGWAV GEYWDTNVDA LLSWAYDSGA KVFDFPLYYK 

Consensus KGY--WW-- WL--W AV GEYWDTNVDA LL-WAY-SGA KVFDFPLYYK 

301 35 0 

SEQIDN0:81 MDEAFDNKNI PALVSALQNG QTWSRD PFK AVTFVANHDT DIIWNKYLAY 

pyro MDEAFDNTNI PALVDALQNG GTWSRD PFK AVTFVANHDT DIIWNKYPAY 

pyro2 MDEAFDNNNI PALVYALQNG QTWSRDPFK AVTFVANHDT DIIWNKYPAY 

thermo MDEAFDNKNI PALVSALQNG QTWSRDPFK AVTFVANHDT DIIWNKYPAY 

thermo2 MDEAFDNNNI PALVDALKNG GTWSRD PFK AVTFVANHDT NIIWNKYPAY 

Consensus MDEAFDN-NI PALV-AL-NG -TWSRDPFK AVTFVANHDT -IIWNKY-AY 
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Figure 14a (cont) 





351 


SEQIDN0:8t 


AFILTYEGQP 


pyro 


AFILTYBGOP 


pyro2 


AFILTYEGOP 


thermo 


AFILTYEGQP 


thenno2 


AFILTYEGQP 


Consensus 


AFILTYEGQP 




401 


SEQIDNOtSl 


MIFVRNGYGS 


pyro 


LIFVRNGDSK 


pyro2 


LIFVRNGDSR 


thermo 


LIFVRNGYGN 


the nno2 


LIFVRN6YGD 


Consensus 


-IFVRNG--- 




451 


SEQIDN0:8I 


DKYVY SSGWV 


pyro 


DKYVE SSGWV 


pyro2 


DKRVD SSGWV 


thermo 


DKYVGSNGWV 


thermo2 


DKWVD SSGRV 


Consensus 


DK-V- S-G-V 



VIFYRDYEEW 
VIFYRDYEEW 
VIFYRDFEEW 
VIFYRDYEEW 
AIFYRDYEEW 
-IFYRD-EEW 



KPGLITYINL 
RPGLITYINL 
RPGHTYINL 
KPGLITYINL 
KPGLITYINL 
-PGLITYINL 



LNKDRLNNLI 
LNKDKUINLI 
LNKDKLINLI 
LNKDRLKNLI 
LNKDRLRNLI 
LNKD-L- NLI 



GSSKVGRWVY 
6SSKVGRWVY 
SPNWVGRWVY 
GSSKVGRWVY 
GSSKAGRWVY 
GRWVY 



WIHDHLAGGS 
WIHDHLAGGS 
WIHDHLAGGS 
WIHNNLAG GS 
WIHDHLAGGS 
WIH--LAGGS 



VPKFAGACIH 
VPKFAGACIH 
VPKFAGACIH 
VPKFAGSCIH 
VPKFAGSCIH 
VPKFAG-CIH 



400 

TSIVYYDSDE 
TSIVYYDSDE 
TTIVYYDNDB 
TSIVYYDNDE 
TDIIYYDSDE 
T-I-YYD-DE 

450 

EYTGNLGGWV 
EYTGNLGGWV 
EYTGNLGGWV 
EYTGNLGGWV 
EYTGNLGGWI 
KYTGNLGGW- 



YFEAPAYDPA 
YLEAPAYDPA 
YLEAPPHDPA 
YLEAPAHDPA 
YLEAPAHDPA 
Y-EAP- -DPA 



466 

NGQYGYSVWS YCGVG* 
SGQYGYTVWS YCGVG* 
NGYYGYSVWS YCGVG* 
KGQYGYSVWS YCGVG* 
NGQYGYSVWS YCGVG* 
-G-YGY-VWS YCGVG* 
Anti sense priaier 
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Figure 14b: 

^ .50 

SEQIDN0:81 MKKFVA LFITOFFVVS MAWAQPASA 

pyro MKKFVA LLITMFFVVS MAAVAQPASA 

SEQ1DN0:73 

« thermo2 MA RKVLVALiVF LWLSVSAVP 

SEQ ID NO: 75 

SEQIDNO:?? « IIIIIIIII 

SEQ ID NO: 83 

SEQ ID NO: 85 

SEQ ID NO: 79 MKPAKL LVFVLWS IL AGLYAQPAGA 

thermo SESQCTATCT WRWYMSAKK LLALLFVLAV LVGVAVIPAR VGIAPVSAGA 

pyro2 VNIKK LTPLLTLLLF FIVLASPVSA 

CLONE A MRRSARV LVLIIAFFLL AGIYYPSTSA 

Consensus * 

51 100 

SEQIDNO: 81 AKYSELEEGG VIMQAFYWDV PGGGIWWDTI RSKIPEWYEA GISAIWIPPA 

pyro AKYSELEEGG VIMQAFYWDV PAGGIWWDTI RSKIPEWYEA GISAIWIPPA 

SEQ ID NO: 73 MALEEGG LIMQAFYWDV PGGGIWWDTI AQKIPDWASA GISAIWIPPA 

thermo2 AKAETLENGG VIMQAFYWDV PGGGIWWDTI AQKIPDWASA GISAIWIPPA 

SEQIDNO: 75 MALEEGG LIMQAFYWDV PMGGIWWDTI AQKIPDWASA GISAIWIPPA 

SEQ1DN0:77 —-MALEEGG LIMQAFYWDV PMGGIWWDTI AQKIPDWASA GISAIWIPPA 

SEQ ID NO: 83 MALEEGG LIMQAFYWDV PGGGIWWDTI AQKIPEWASA GISAIWIPPA 

SEQIDNO: 85 MALEEGG LIMQAFYWDV PGGGIWWDTI AQKIPBWASA GISAIWIPPA 

SEQIDNO:79 AKYLELEEGG VIMQAFYWDV PSGGIWWDTI RQKIPEWYDA GISAIWIPPA 

thermo TSRPSLEEGG VIMQAFYWDV PAGGIWWDTI RSKIPDWASA GISAIWIPPA 
pyro2 , AKYLELEEGG VIMQAFYWDV PGGGIWWDHI RSKIPEWYEA GISAIWLPPP 

CLONEA AKYSELEQGG VIMQAFYWDV PEGGIWWDTI RQKIPEWYDA GISAIWIPPA 

Consensus GG -IMQAFYWDV P-GGIWWD-I --KIP-W- -A GISAIW-PP- 

101 15 0 

SEQIDNO: 81 SKGMSGGYSM GYDPYDFFDL GEYNQKGTIE TRFGSKQELI NMINTAHAYG 

pyiro SKGMGGAYSM GYDPYDFFDL GEYNQKGTVE TRFGSKQELI NMINTAHAYG 

SEQ ID NO: 73 SKGMSGGYSM GYDPYDFFDL GEYYQKGSVE TRFGSKEELV NMINTAHAHN 

thermo2 SKGMSGGYSM GYDPYDFFDL GEYYQKGSVE TRFGSKEELV NMINTAHAHN 

SEQ ID NO: 75 SKGMSGGYSM GYDPYDYFDL GEYYQKGTVE TRFGSKQELI NMINTAHAYG 

SEQIDNO: 77 SKGMSGGYSM GYDPYDYFDL GEYYQKGTVE TRFGSKQELI NMINTAHAYG 

SEQ ID NO: 83 SKGMSGGYSM GYDPYDFFDL GEYYQKGTVE TRFGSKEELV NMINTAHSYG 

SEQ ID NO: 85 SKGMSGGYSM GYDPYDFFDL GEYYQKGTVE TRFGSKEELV NMINTAHSYG 

SEQ ID NO: 79 SKGMGGAYSM GYDPYDFFDL GEYDQKGTVE TRFGSKQELV NMINTAHAYG 

thermo SKGMSGAYSM GYDPYDFFDL GEYYQKGTVE TRFGSKQELI NMINTAHSYG 

pyro2 SKGMSGGYSM GYDPYDYFDL GEYYQKGTVE TRFGSKEELV RLIQTAHAYG 

CLONEA SKGMGGAYSM GYDPYDYFDL GEFYQKGTVE TRFGSKEELV NMISTAHQYG 

Consensus SKGM-G-YSM GYDPYD-FDL GE--QKG--E TRFGSK-EL- «-I-TAH 

151 200 

SEQ ID NO: 81 IKVIADIVIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

pyro IKVIADIVIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

SEQ ID NO: 73 MKVIADIVIN HRAGGDLEWN PFTNSYTWTD FSKVASGKYT ANYLDFHPNE 

Chermo2 MKVIADIVIN HRAGGDLEWN PFTNSYTWTD FSKVASGKYT ANYLDFHPNE 

SEQ ID NO: 75 MKVIADIVIN HRAGGDLEWN PFVNDYTWTD FSKVASGKYT ANYLDFHPNE 

SEQ ID NO: 77 MKVIADIVIN HRAGGDLEWN PFVNDYTWTD FSKVASGKYT ANYLDFHPNE 

SEQIDNO:83 IKVIADIVIN HRAGGDLEWN PFVNDYTWTD FSKVASGKYT ANYLDFHPNE 

SEQ ID NO: 85 IKVIADIVIN HRAGGGLEWN PFVNDYTWTD FSKVASGKYT ANYLDFHPNE 

SEQ ID NO: 79 IKVIADIVIN HRAGGDLEWN PFVNDYTWTD FSKVASGKYT ANYLDFHPNE 

thermo IKVIADIVIN HRAGGDLEWN PFTNSYTWTD FSKVASGKYT ANYLDFHPNE 

pyro2 IKVIADWIN HRAGGDLEWN PFVGDYTWTD FSKVASGKYT ANYLDFHPNE 

CLONEA IKVIADIVIN HRAGGDLEWN PYVGDYTWTD FSKVASGKYK AHYMDFHPNN 

Consensus -KVIAD-VIN HRAGG-LEWN P YTWTD FSKVASGKY- A-Y-DFHPN- 
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Figure 14b (com.) 



SEQIDN0:8I 
pyro 
SEQIDNO: 73 
the rino2 
SEQIDNO: 75 
SEQIDNO: 77 
SEQIDNO: 83 
SEQ ID NO: 85 
SEQ ID NO: 79 
thermo 
pyro2 
CLONE A 
Cons ensus 



201 

VKCCDEGTFG 
VKCCDEGTFG 
LHAGDSGTFG 
LHAGDSGTFG 
LHAGDSGTFG 
LHAGDSGTFG 
LHCCDEGTFG 
LHCCDEGTFG 
VKCCDEGTFG 
VKCCDEGTFG 
LHCCDEGTFG 
YSTSDEGTFG 
D-GTFG 



GFPDIAHEKS 
GFPDIAHEKE 
GYPDICHDKS 
GYPDICHDKS 
GYPDICHDKS 
GYPDICHDKS 
GYPDICHDKS 
GYPDICHDKS 
GFPDIAHEKS 
GFPDIAHEKS 
GFPDICHHKE 
GFPDIDHLVP 
G-PDI-H--- 



WDQHWLHASD 
HDQHWLHASD 
HDQHWLWASN 
WDQHWLWASN 
WDQYWLWASQ 
WDQYWLWASQ 
VnDQYWLWASS 
WDQYWLWASS 
WDQYWLWASN 
WDQYWLWASQ 
WDQYWLWKSN 
FNQYWLWASN 
--Q-WLW-S- 



ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
KSYAAYLRSI 
ESYAAYLRSI 
ESYAAYLRSI 
-SYAAYLRSI 



25 0 

GVDAWRFDYV 
GVDAWRFDYV 
GIDAWRFDYV 
GIDAWRFDYV 
GIDAWRFDYV 
GIDAWRFDYV 
GVDAWRFDYV 
GVDAWCFDYV 
GVDAWRFDYV 
GIDAWRFDYV 
GFDGWRFDYV 
GIDAWRFDYV 
G-D-W-FDYV 



SEQ ID NO: 81 
pyro 
SEQIDNO: 73 

the rmo2 
SEQ ID NO: 75 

SEQ ID NO: 77 
SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 79 
thermo 
pyro2 
CLONE A 
Cons ensus 



251 

KGYGAWWKD 
KGYGAWWKD 
KGYAPWWKN 
KGYAPWWKN 
KGYAPWWRD 
KGYAPWWKD 
KGYGAWWND 
KGYGAWWND 
KGYGAWWKD 
KGYGAWWKD 
KGYGAWWRD 
KGYGAWWKD 
KGY--WW-- 



WLNWWGGWAV 
WLNWWGGWAV 
WLNRWGGWAV 
WLNRWGGWAV 
WLNWWGGWAV 
WLNWWGGWAV 
WLSWWGGWAV 
WLSWWGGWAV 
WLDWWGGWAV 
WLKWW. ALAV 
WLNWWGGWAV 
WLSQWGGWAV 
WL--W AV 



GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 
GEYWDTNVDA 



LLNWAYSSGA 
LLNWAYSSGA 
LLSWAYDSGA 
LLSWAYDSGA 
VLNWAYSSGA 
VLNMAYSSGA 
LLNWAYSSGA 
LLNWAYNSGA 
LLNWAYSSDA 
LLNWAYSSGA 
LLSWAYESGA 
LLNWAYSSGA 
-L-WAY-S-A 



300 

KVFDFPLYY K 
KVFDFPLYYK 
KVFDFPLYY K 
KVFDFPLYYK 
KVFDFALYY K 
KVFDFALYYK 
KVFDFPLYYK 
KVFDFPLYYK 
KVFDFPLYYK 
KVFDFPLYYK 
KVFDFPLYYK 
KVFDFPLYYK 
KVFDF-LYYK 



SEQ ID NO: 81 
pyro 
SEQ ID NO: 73 
the rmo2 
SEQIDNO: 75 

SEQIDNO: 77 
SEQIDNO: 83 
SEQIDNO: 85 

SEQIDNO: 79 
thermo 
pyro2 
CLONE A 
Cons ensus 



301 

MDEAFDNKNI 
MDEAFDNTNI 
MDEAFDNNNI 
MDEAFDNNNI 
MDEAFDNNNI 
MDEAFDNNNI 
MDEAFDNTNI 
MDEAFDNTNI 
MDAAFDNKNI 
MDEAFDNKNI 
MDEAFDNNNI 
MDEAFDNKNI 
MD-AFDN-NI 



PALVSALQNG 
PALVDALQNG 
PTILVDALKNG 
PALVDALKNG 
PALVDALRYG 
PALVDALRYG 
PALVDALRYG 
PALVYALKNG 
PALVEALKNG 
PALVSALQNG 
PALVYALQNG 
PALVYA IQNG 
PALV-A---G 



QTWSRD PFK 
GTWSRD PFK 
GTWSRDPFK 
GTWSRD PFK 
QTWSRD PFK 
QTWSRD PFK 
QTWSRD PFK 
GTWSRD PFK 
GTWSRDPFK 
QTWSRD PFK 
QTWSRD PFK 
ETWSRD PFK 
-TWSRDPFK 



AVTFVANH DT 
AVTFVANHDT 
AVTFVANH DT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANHDT 
AVTFVANH DT 
AVTFVANHDT 
AVTFVANHDT 



35 0 

DIIWNKYLAY 
DIIWNKYPAY 
NIIWNKYPAY 
NIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
DIIWNKYPAY 
NIIWNKYPAY 
-IIWNKY-AY 



SEQ ID NO: 81 
pyro 
SEQIDNO: 73 
thermo2 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 83 
SEQIDNO: 85 
SEQ ID NO: 79 
thermo 
pyro2 
CLONE A 
Consensus 



351 

AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 
AFILTYEGQP 



VIFYRDYEEW 
VIFYRDYEEW 
AIFYRDYEEW 
AIFYRDYEEW 
TIFYRDYEEW 
TIFYRDYEEW 
VIFYRDYEEW 
VIFYRDYEEW 
TIFYRDYEEW 
VIFYRDYEEW 
VIFYRD FEEW 
VIFYRDYEEW 
-IFYRD-EEW 



LNKDRLNNLI 
LNKDKLNNLI 
LNKDRLRNLI 
LNKDRLRNLI 
LNKDKLKNLI 
LNKDKLKNLI 
LWKDKLNNLI 
LNKDKLNNLI 
LNKDRLKNLI 
LNKDRLKNLI 
LNKDKLI NLI 
LNKDKLNNLI 
LNKD-L- NLI 



WIHDHLAGGS 
WIHDHLAGGS 
WIHDHLAGGS 
WIHDHLAGGS 
WIHDNLAGGS 
WIHDNLAGGS 
WIHDHLAGGS 
WIHDHLAGGS 
WIHDHLAGGS 
WIHNNLAGGS 
WIHDHLAGGS 
WIHEHLAG GS 
WIH--LAGGS 



400 

TSIVYYDSDB 
TSIVYYDSDB 
TDIIYYDSDE 
TDIIYYDSDE 
TDIVYYDNDB 
TDIVYYDNDE 
TDIVYYDSD E 
TDIVYYDSDE 
TDIVYYDNDE 
TSIVYYDNDE 
TTIVYYDNDE 
TKILYYDDDE 
T-I-YYD-DE 



\ 



wo 02/068589 

u re 14b (cont.) 



SEQIDNO: 8t 
pyro 
SEQFDNO: 73 
the nno2 
SEQIDNO: 75 
SEQIDNO: 77 
SEQ ID NO: 83 
SEQIDNO: 85 
SEQ ID NO: 79 
thermo 
pyro2 
CLONE A 
Cons ensus 



SEQ ID NO: 81 
pyro 
SEQ ID NO: 73 
thermo2 
SEQIDNO: 75 

SEQ ID NO: 77 
SEQIDNO: 83 
SEQIDNO: 85 

SEQIDNO: 79 
th ermo 
pyro2 
CLONE A 
Cons ensus 



401 

MIFVRNGYGS 
LIFVRNGDSK 
LIFVRNGYGD 
LIFVRNGYGD 
LIFVRNGYGS 
LIFVRNGYGS 
LIFVRNGYGT 
LIFVRNGYGT 
LIFVRNGYGD 
LIFVRNGYGN 
LIFVRNGDSR 
LIFMREGYGD 
-IF-R-G--- 

451 

VDKYVYSSGW 
VDKYVESSGW 
IDKWVDSSGR 
IDKWVDSSGR 
VDKWVDSSGW 
VDKWVDSSGW 
IDKYVSSSGW 
IDKYVSSSGW 
VDKWVDSSGW 
VDKYVGSNGW 
VDKRVDSSGW 
VDRYVQYDGW 
-D--V---G- 



KPGLITYINL 
RPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
KPGLITYINL 
RPGLITYINL 
RPGLITYINL 
-PGLITYINL 



VYFEAPAYDP 
VYLEAPAYDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPAHDP 
VYLEAPPHDP 
VKLTAPPHDP 
V---AP--DP 
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GSSKVGRWVY 
GSSKVGRWVY 
GSSKAGRWVY 
GSSKAGRWVY 
GSSKAGRWVY 
ASSKAGRWVY 
GSSKVGRWVY 
GSSKAGRWVY 
GSSKAGRWVY 
GSSKVGRWVY 
SPNWVGRWVY 
GSDWAERWVN 
RWV- 



ANGQYGYSVW 
ASGQYGYTVW 
ANGQYGYSVW 
ANGQYGYSVW 
ANGQYGYSVW 
ANGQYGYSVW 
ANGYYGY SVW 
ANGQYGY SVW 
ANGYYGY SVW 
AKGQYGYSVW 
ANGYYGY SVW 
ANGYYGY SVW 
A-G-YGY -VW 



PCT/US02/05068 



V.PKFAGACI 
V.PKFAGACI 
V.PKFAGSCI 
V.PKFAGSCI 
V.PKFAGSCI 
V.PKFAGSCI 
V.PKFAGSCI 
V.PKFAGSCI 
V.PKFAGACI 
V.PKFAGSCI 
V.PKFAGACI 
VGSKFAGYTI 
V--KFAG- -I 

487 
SYCGVG* 

SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYCGVG* 
SYAGVG* 
SY-GVG* 



450 

HBYTGNLGGW 
HEYTGNLGGW 
HBYTGNLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTGSLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTGNLGGW 
HEYTG-LGGW 
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Figure 14c: 



1 50 

SEQ ID NO: 83 — ^ 

SEQ ID NO: 85 

SEQ ID NO: 75 

SEQ ID NO: 77 

SEQ ID NO: 73 : 

SEQ ID NO: 79 ---ATGAAGC CTGCGAAACT CCTCGTCTTT GTGCTCGTAG TCTCTATCCT 

SEQ ID NO: 81 ATGAAGA AGTTTGTCGC CCTGTTCATA ACCATGTTTT TCGTAGTGAG 

CLONE A ATGAGGAGAT CCGCAAGGGT TTTGGTTCT6 ATTATAGCGT TTTTCXrrCCT 

Consensus — * ^ — * — 

51 100 

SEQIDNO:83 ATGGCTCTGG 

SEQ ID NO: 85 ATGGCTCTGG 

SEQIDNO:75 ,^ ATGGCTCTGG 

SEQ ID NO: 77 ATGGCTCTGG 

SEQIDNO:73 , ATGGCTCTGG 

SEQ ID NO: 79 CGCGGGGCTC TACGCCCAGC CCGCGGGGGC GGCCAAGTAC CTGGAGCTCG 

SEQ ID NO: 81 CATGGCAGTC GTTGCACAGC CAGCTAGCGC CGCAAAGTAT TCCGAGCTCG 

CLONE A GGCGGGGATT TACTACCCCT CCACGAGTGC CGCGAAGTAC TCCGAGCTGG 

Consensus 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



101 

AA6AGGGCGG 
AAGAGGGC6G 
AAGAGGGCGG 
AAGAGGGCGG 
TAGAGGGCGG 
AAGAGGGCGG 
AAGAAGGCGG 
AGCAGGGCGG 
GGCGG 



GCTCATAATG 
GCTTATAATG 
GCTTATAATG 
GCTCATAATG 
GCTTATAATG 
CGTCATAATG 
CGTTATAATG 
AGTCATAATG 
--T-ATAATG 



CAGGCCTTCT 
CAGGCATTCT 
CAGGCATTCT 
CAGGCCTTCT 
CAGGCCTTCT 
CAGGCGTTCT 
CAGGCCTTCT 
CAGGCCTTCT 
CAGGC-TTCT 



ACTGGGATGT 
ATTGGGACGT 
ACTGGGAC6T 
ACTGGGACGT 
ACTGGGACGT 
ACTGGGACGT 
ACTGGGACGT 
ACTGGGACGT 
A-TGGGA-GT 



150 

TCCTGGAGGA 
CCCAGGTGGA 
CCCCATGGGA 
CCCCATGGGA 
CCCAGGTGGA 
GCCTTCAGGA 
CCCAGGTGGA 
TCCGGAGGGA 
-CC GGA 



SEQ ID Na 83 
SEQ ID NO: 85 
SEQ ID NO: 75 

SEQ ID NO: 77 
SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consensus 



151 

GGAATCTGGT 
GGAATCTGGT 
GGAATCTGGT 
GGAATCTGGT 
GGAATCTGGT 
GGAATATGGT 
GGAATCTGGT 
GGAATCTGGT 
GGAAT-TGGT 



GGGACACAAT 
GGGACACCAT 
GGGACACGAT 
GGGACACGAT 
GGGACACCAT 
GGGACACAAT 
GGGACACCAT 
GGGACACAAT 
GGGACAC-AT 



AGCTCAAAAG 
AGCCCAGAAG 
AGCCCAGAAG 
AGCCCAGAAG 
AGCCCAGAAG 
ACGGCAGAAG 
CAGGAGCAAG 
ACGGCAGAAG 
AAG 



ATACCCGAAT 
ATACCCGAAT 
ATACCCGACT 
ATACCCGACT 
ATACCCGACT 
ATACCGGAGT 
ATACCGGAGT 
ATCCCTGAAT 
AT-CC-GA-T 



200 

GGGCAAGTGC 
GGGCAAGTGC 
GGGCAAGCGC 
GGGCAAGCGC 
GGGCGAGCGC 
GGTACGATGC 
GGTACGAGGC 
GGTACGATGC 
GG GC 



SEQ ID NO: 83 
SEQ ID NO: 8S 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consensus 



201 

AGGAATCTCA 
AGGAATCTCA 
CGGGATTTCG 
CGGGATTTCG 
CGGGATTTCG 
CGGAATCTCC 
GGGAATATCC 
AGGCATATCC 
-GG-AT-TC- 



GCGATATGGA 
GCGATATG6A 
GCGATATGGA 
GCGATATGGA 
GCAATATGGA 
GCAATATGGA 
GCCATTTGGA 
GCCATCTGGA 
GC-AT-TGGA 



TTCCACCAGC 
TTCCACCAGC 
TTCCCCCCGC 
TCCCTCCCGC 
TTCCTCCCGC 
TTCCCCCGGC 
TTCCGCCAGC 
TACCCCCGGC 
T-CC-CC-GC 



GAGTAAGGGC 
GAGTAAGGGA 
GAGCAAGGGT 
GAGCAAGGGT 
GAGTAAGGGC 
GAGCAAGGGC 
CAGCAAGGGG 
GAGCAAGGGC 
-AG-AAGGG- 



250 

ATGAGCGGTG 
ATGAGCGGTG 
ATGAGCGGCG 
ATGAGCGGCG 
ATGAGCGGCG 
ATGGGCGGCG 
ATGAGCGGCG 
ATGGGCGGGG 
ATG^GCGG-G 
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Figure 14c (cont.) 

251 

SEQIDNO: 83 
SEQIDNO: 85 
SEQIDNaTS 
SEQIDNO: 77 

SEQIDNO: 73 
SEQ ID NO: 79 
SEQIDNO: 81 
CLONEA 

Consen sus 



GTTATTCCAT 
GTTATTCCAT 
GCTATTC GAT 
GCTATTCGAT 
GCTATTC GAT 
CCTATTC GAT 
GTTACTC GAT 
CCTACTCGAT 
--TA-TC-AT 



GGGCTACGAT 
GGGCTACGAT 
GGGCTACGAC 
GGGCTACGAC 
GGGCTACGAC 
GGGCTACGAC 
GGGCTACGAT 
GGGCTACGAC 
GGGCTACGA- 



CCCTACGATT 
CCCTACGATT 
CCCTACGATT 
CCCTACGATT 

CCCTACGATT 
CCCTACGACT 
CCCTACGATT 
CCCTACGATT 
CCCTACGA- T 



TCTTTGACCT 
TCTTTGACCT 
ATTTTGACCT 
ATTTTGACCT 
TCTTCGACCT 
TCTTTGACCT 
TCTTTGACCT 
ACTTCGATCT 
--TT-GA-CT 



300 

CGGCGAGTAC 
CGGCGAGTAC 
CGGT6AGTAC 
CGGTGAGTAC 
CGGTGAGTAC 
CGGTGAGTAC 
CGGCGAGTAC 
GGGCGAGTTT 
-GG-GAGT-- 



SEQIDNO:83 
SEQ ID NO. 85 
SEQIDNO: 75 
SEQ ID NO: 77 
SEQIDNO: 73 
SEQIDNO: 79 
SEQ ID NO: 81 
CLONEA 

Consen sus 



301 

TATCAGAAGG 
TATCAGAAGG 
TACCAGAAGG 
TACCAGAAGG 
TACCAGAAGG 
GACCAGAAGG 
AACCAGAAGG 
TACCAGAAGG 
-A-CAGAAGG 



GGACAGTTGA 
GGACAGTTGA 
6AACGGTGGA 
GAACGGTGGA 
GAAGCGTTGA 
GAACGGTAGA 
GAACCATCGA 
GAACCGTTGA 
G-A---T-GA 



GACGCGCTTC 
GACGCGCTTC 
AACAAGATTC 
AACGAGGTTC 
GACGCGCTTC 
GACGCGCTTT 
AACGCGCTTT 
GACGCGCTTC 
-AC--G-TT- 



GGCTCAAAGG 
GGCTCAAAGG 
GGCTCAAAGC 
GGCTCAAAGC 
GGATCAAAAG 
GGCTCCAAGC 
GGCTCTAAAC 
GGCTCCAAGG 
GG-TC-AA-- 



350 

AAGAACTGGT 
AAGAACTGGT 
AGGAGCTCAT 
AGGAGCTCAT 
AGGAGCTTGT 
AGGAGCTCGT 
AGGAGCTCAT 
AAGAGCTCGT 
A-GA-CT--T 



SEQ ID NO: 83 
SEQ ID Na 85 
SEQ ID NO: 75 
SEQ ID NO: 77 

SEQ ID NO: 73 
SEQ ID NO: 79 
SEQIDNO: 81 
CLONEA 

Consen sus 



351 

GAACATGATA 
GAACATGATA 
AAACATGATA 
AAACATGATA 
GAACATGATA 
GAACATGATA 
CAATATGATA 
CAACATGATC 
-AA-ATGAT- 



AACACCGCAC 
AACACCGCAC 
AACACCGCCC 
AACACCGCCC 
AACACCGCCC 
AACACCGCCC 
AACACCGCCC 
TCCACGGCCC 
--CAC-GC-C 



ACTCCTACGG 
ACTCCTACGG 
ACGCCTATGG 
ACGCCTATGG 
ATGCTCACAA 
ACGCCTACGG 
ATGCCTACGG 
ACCAGTACGG 
A A 



CATAAAGGT6 
CATAAAGGTG 
CATGAAGGTA 
CATGAAGGTA 
CATGAAGGTC 
CATCAAGGTC 
CATAAAGGTC 
CATCAAGGTT 
CAT-AAGGT- 



400 

ATAGCAGACA 
ATAGCGGACA 
ATAGCCGATA 
ATAGCCGATA 

ATAGCGGACA 
ATCGCAGACA 
ATAGCGGACA 
ATAGCGGACA 
AT-GC-GA-A 



SEQIDNO: 83 
SEQIDNO: 85 
SEQIDNO: 75 
SEQ ID NO: 77 
SEQ ID Na 73 
SEQIDNO: 79 
SEQ1DN0:_81 
CLONEA 

Consen sus 



401 

TAGTCATAAA 
TAGTCATAAA 
TAGTCATCAA 
TAGTCATCAA 
TAGTCATCAA 
TAGTAATCAA 
TCGTCATAAA 
TAGTCATAAA 
T-GT-AT -AA 



CCACCGCGCC 
CCACCGCGCC 
CCACCGCGCC 
CCACCGCGCC 
CCACCGCGCC 
CCACCGCGCC 
CCACCGCGCA 
CCACCGCGCA 
CCACCGCGC- 



GGTGGAGACC 
GGTGGAGGCC 
GGCGGCGATC 
GGCGGTGACC 
GGCGGCGACC 
GGAGGAGACC 
GGCGGAGACC 
GGTGGAGACC 
GG-GG-G--C 



TTGAGTGGAA 
TCGAdTGGAA 
TGGAGTGGAA 
TGGAGTGGAA 
TGGAGTGGAA 
TTGAGTGGAA 
TCGAGTGGAA 
TCGAATGGAA 
T-GA-TGGAA 



450 

CCCCTTCGTG 
CCCCTTCGTG 
CCCCTTCGTG 
CCCCTTCGTG 
TCCTTTCACC 
CCCCTTCGTC 
CCCGTTCGTT 
CCCATACGTC 
-CC-T-C--- 



SEQ ID NO: 83 
SEQIDNO: 85 
SEQ ID NO: 75 
SEQ ID Na 77 
SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID Nasi 
CLONEA 

Consen sus 



451 

AACGACTATA 
AACGACTATA 
AACGACTATA 
AACGACTATA 
'AACAGCTACA 
AATGACTACA 
GGGGACTACA 
GGCGACTATA 
CTA-A 



CCTGGACAGA 
CCTGGACAGA 
CCTGGACCGA 
CCTGGACCGA 
CCTGGACCGA 
CCTGGACGGA 
CCTGGACGGA 
CCTGGACGGA 
CCTGGAC-GA 



CTTCTCAAAA 
CTTCTCAAAA 
CTTCTCGAAG 
CTTCTCAAAG 
TTTCTCGAAG 
CTTCTCGAAG 
CTTCTCAAAG 
CTTTTCTAAG 
-TT-TC-AA- 



GTCGCCTCCG 
GTCGCCTCCG 
GTCGCGTCGG 
GTCGCGTCGG 
GtCGCGTCGG 
GTCGCTTCCG 
GTGGCCTCGG 
GTCGCCTCCG 
GT-GC-TC-G 



500 

GTAAATATAC 
GTAAATATAC 
GTAAATACAC 
GTAAATACAC 
GCAAGTACAC 
GCAAGTACAC 
GCAAATATAC 
GGAAATACAA 
G-AA-TA-A- 
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Figure 14c (cont.) 

501 

SEQIDNO:83 
SEQ ID NO: 8S 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 



SEQ ID NO: 81 
CLONEA 

Consensus 



GGCCAACTAC 
AGCCAACTAC 
GGCCAACTAC 
GGCCAACTAC 
GGCCAACTAC 
GGCCAACTAC 
TGCCAACTAC 
GGCCCACTAC 
-GCC-ACTAC 



CTTGACTTCC 
CTTGACTTCC 
CTCGACTTCC 
CTCGACTT CC 
CTCGACTTCC 
CTCGACTTCC 
CTCGACTT CC 
ATGGACTTCC 
-T-GACTTCC 



ACCCAAACGA 
ACCCAAACGA 
ACCCGAACGA 
ACCCGAACGA 
ACCCGAACGA 
ACCCCAACGA 
ACCCGAACGA 
ATCCAAACAA 
A-CC-AAC- A 



GCTTCACTGT 
GCTTCACTGT 
GCTCCACGCG 
GCTCCATGCG 
GCTTCACGCG 
GGTCAAGTGC 
GGTCAAGTGC 
CTACAGCACC 



550 

TGTGATGAAG 
TGTGATGAAG 
GGCGATTCCG 
GGCGATTCCG 
GGCGATTCCG 
TGCGACGAGG 
TGTGACGAGG 
TCAGACGAGG 
---GA G 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID Na 77 
SEQ ID Na 73 
SEQ ID Na 79 

SEQ ID Na 81 
CLONEA 

Consensus 



551 

GTACCTTTGG 
GTACCTTTGG 
GAACATTTGG 
GAACATTTGG 
GAACATTTGG 
GCACCTTTGG 
GCACATTTGG 
GAACCTTCGG 
G-AC-TT-GG 



AGGATACCCT 
AQGATACCCT 
AGGCTATCCC 
AGGCTATCCC 
AGGCTATCCC 
AGGGTTCCCG 
AGGCTTCC CA 
TGGCTTCCCA 
-GG-T--CC-' 



GATATATGTC 
GATATATGTC 
GACATATGCC 
GACATATGCC 
GACATATGCC 
GACATAGCCC 
GACATAGCCC 
GACATTGATC 
GA-AT C 



ACGACAAAAG 
ACGACAAAAG 
ACGACAAGAG 
ACGACAAGAG 
ACGACAAGAG 
ACGAGAAGAG 
ACGACAAGAG 
ACCTCGTGCC 
AC- 



600 

CTGGGACCAG 
CTGGGACCAG 
CTGGGACCAG 
CTGGGACCAG 
CTGGGACCAG 
CTGGGACCAG 
CTGGGACCAG 
CTTCAACCAG 
CT— -ACCAG 



SEQ ID Na 83 
SEQ ID Na 85 
SEQ ID Na 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID Na 79 
SEQ ID Na 81 
CLONEA 
Consensus 



601 

TACTGGC TCT 
TACTGGCTCT 
TACTGGC TCT 
TACTGGCTCT 
CACTGGCTCT 
TACTGGCTCT 
CACTGGCTCT 
TACTGGC TGT 
-ACTGGCT-T 



GGGCGAGCAG 
GGGCGAGCAG 
GGGCCAGCCA 
GGGCCA6CCA 
GGGCCAGCAA 
GGGCGA6CM 
GGGCGAGCGA 
GGGCGAGCAA 
GGGC-AGC-- 



CGAAAGCTAC 
CGAAAGCTAC 
GGAGAGCTAC 
GGAGAGCTAC 
CGAAAGCTAC 
GGAGAGCTAC 
TGAGAGCTAC 
CGAGAGCTAC 
-GA-AGCTAC 



GCTGCCTACC 
GCTGCCTACC 
GCGGCCTATC 
GCGGCATATC 
GCCGCCTACC 
GCCGCCTACC 
GCCGCCTACC 
GCCGCCTACC 
GC-GC-TA-C 



650 

TCAGGAGCAT 
TCAGGAGCAT 
TCAGGAGCAT 
TCAGGAGCAT 
TCCGGAGCAT 
TCAGGAGCAT 
TAAGGAGCAT 
TCAGGAGCAT 
T--GGAGCAT 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 
SEQ ID Na 81 
CLONEA 

Consen sus 



651 

AGGGGTTGAC 
AGGGGTT6AC 
CGGCATCGAC 
CGGCATCGAT 
CGGCATCGAC 
CGGCGTTGAC 
CGGCGTTGAT 
AGGGATCGAT 
-GG--T- GA- 



GCCTGGCG TT 
GCCTGGTGTT 
GCCTGGCG CT 
GCCTGGCG CT 
GCCTGGCG CT 
GCATGGCGCT 
GCCTGGCG CT 
GCGTGGCG CT 
GC-TGG-G -T 



TCGACTACGT 
TCGACTACGT 
TCGACTACGT 
TCGACTACGT 
TCGACTACGT 
TCGACTACGT 
TTGACTACGT 
TTGACTACGT 
T-GACTACGT 



CAAGGGCTAC 
CAAGGGCTAC 
CAAGGGCTAT 
CAAGGGCTAT 
CAAGGGCTAC 
CAAGGGCTAC 
CAAGGGCTAC 
TAAGGGCTAC 
-AAGGGCTA- 



700 

GGAGCATGGG 
GGAGCCTGGG 
GCTCCCTGGG 
GCTCCCTGGG 
GCTCCCTGGG 
GGAGCGTGGG 
GGAGCGTGGG 
GGCGCGTGGG 
G---C-TGGG 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID Na 79 
SEQ ID Na 81 
CLONEA 

Consen sus 



701 

TTGTTAACGA 
TTGTTAACGA 
TCGTCAGGGA 
TCGTCAAGGA 
TCGTTAAGAA 
TCGTCAAGGA 
TCGTCAAGGA 
TCGTCAAGGA 
T-GT-A A 



CTGGCTCAGC 
CTGGCTCAGC 
CTGGCTGAAC 
CTGGCTGAAC 
CTGGCTGAAC 
CTGGCTGGAC 
CTGGCTGAAC 
CTGGCTGAGT 
CTGGCT 



TGGTGGGGAG 
TGGTGGGGAG 
TGGTGGGGAG 
TGGTGGGGAG 
CGGTGGGGCG 
TGGTGGGGAG 
TGGTGGGGCG 
CAGTGGGGCG 
--GTGGGG-G 



GCTGGGCCGT 
GCTGGGCCGT 
GCTGGGCAGT 
GCTGGGCGGT 
GCTGGGCGGT 
GCTGGGCCGT 
GCTGGGCCGT 
GCTGGGCCGT 
GCTGGGC-GT 



750 

TGGAGAGTAC 
TGGAGAGTAC 
TGGAGAGTAC 
TGGAGAGTAC 
TGGAGAGTAC 
CGGGGAGTAC 
TGGCGAGTAC 
CGGCGAGTAC 
-GG-GAGTAC 
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Figure 14c (cont.) 



SEQIDNO: 89 
SEQIONO: 85 
SEQ ID NO: 7S 
SEQIDNO: 77 
SEQIDNO: 73 

SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



751 

TGGGACACGA 
TGGGACACTA 
TGGGACACGA 
TGGGACACGA 
TGGGACACGA 
TGGGACACAA 
TGGGACACGA 
TGGGACACGA 
TGGGACAC-A 



ACGTTGATGC 
ACGTTGATGC 
ACGTCGACGC 
ACGTCGACGC 
ACGTCGATGC 
ACGTTGATGC 
ACGTTGATGC 
ACGTCGATGC 
ACGT-GA- GC 



ACTCCTCAAC 
ACTCCTCAAC 
TGTTCTCAAC 
TGTTCTCAAC 
ACTCCTGAGC 
ACTCCTCAAC 
ACTCCTCAAC 
GCTCCTCAAC 
--T-CT-A-C 



TGGGCATACA 
TGGGCATACA 
TGGGCATACT 
TGGGCATACT 
TGGGCCTACG 
TGGGCCTACT 
TGGGGCTACT 
TGGGGCTACA 
TGGGC-TAC- 



800 

GCAGCGGCGG 
ACAGCGGCGC 
CGAGCGGTGC 
CGAGCGGTGC 
ACAGCGGTGC 
CGAGCGATGC 
CGAGCGGCGC 
GCAGCGGCGG 
--AGCG--GC 



SEQ ID NO: 83 
SEQIDNO: 85 
SEQ ID NO: 75 

SEQIDNO: 77 
SEQIDNO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 

CLONE A 

Consen sus 



801 

CAAGGTCTTT 
CAAGGTCTTT 
CAAGGTCTTT 
CAAGGTCTTT 
TAAAGTCTTC 
AAAAGTCTTC 
CAAGGTCTTC 
CAAGGTCTTC 
-AA-GTCTT- 



GACTTGCCGC 
GACTTCCCGC 
GACTTCGCCC 
GAGTTCGCCC 
GACTTCCCGC 
GACTTCCCGC 
GACTTCCCGC 
GACTTCCCGC 
GAGTTG-C-C 



TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 
TCTACTACAA 



GATGGACGAA 
GATGGACGAA 
GATGGACGAG 
GATGGACGAG 
GATGGACGAG 
GATGGACGGG 
GATGGATGAG 
GATGGACGAG 
GATGGA-G-- 



850 

GCCTTCGAGA 
GCCTTCGACA 
GCCTTCGATA 
GCCTTCGATA 
GCCTTCGATA 
GCCTTTGACA 
GCCTTTGACA 
GCCTTTGACA 
GCCTT-GA-A 



SEQ ID Na 83 
SEQ ID NO: 85 
SEQIDNO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 

SEQ ID NO: 8 1 
CLONE A 

Consen sus 



851 

ACACCAAGAT 
ATACCAAGAT 
ACAACAACAT 
ACAACAACAT 
ACAACAACAT 
ACAACAACAT 
ACAAAAACAT 
ACAACAACAT 
A-A--AACAT 



CCCGGGATTA 
CCCGGCTTTG 
TCCCGGCCTG 
TCCCGCCCTG 
CCGCGGCCTC 
TCCCGGACTC 
TCCAGCGCTC 
TCCCGCCCTG 
-CG-GC--T- 



GTGGATGCAC 
GTTTACGCC C 
GTGGAGGCCG 
GTGGAGGCCC 
GTGGAGGCCC 
GTCGAGGCCC 
GTCTCTGCC C 
GTTTACGCC A 
GT GC- - 



TCAGATACGG 
TCAAGAATGG 
TCAGATACGG 
TCAGATACGG 
TCAAGAACGG 
TCAAGAACGG 
TTGAGAACGG 
TCCAGAACGG 
T A-GG 



900 

CCAGACAGTG 
GGGGAGAGTG 
CCAGACAGTG 
TCAGAGAGTG 
AGGCACGGTG 
GGGGACAGTG 
CCAGACTGTT 
TGAAACCGTC 
AC-GT- 



SEQIDNO: 83 
SEQ ID Na 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 

SEQ ID NO: 81 
CLONE A 

Consensus 



901 

GTGAGCCGCG 
GTCAGGGGCG 
GTGAGCCGCG 
GTGAGCCGCG 
GTGAGCCGCG 
GTGAGCCGCG 
GTCTCCCGCG 
GTCAGCAGGG 
GTC--C-G-G 



ATGGCTTCAA 
ACCGATTCAA 
AGGGGTTCAA 
ACCGGTTCAA 
AGCGGTTCAA 
ACCCGTTTAA 
ACCGGTTCAA 
ATCGCTTCAA 
A-CG-TT-AA 



GGCGGTAACT 
GGCGGTAACT 
GGCTGTGAGG 
GGCTGTGACG 
AGCCGTGACC 
GGCCGTAACG 
GGCCGTAACC 
GGCCGTTACC 
-GC-GT-AC- 



TTCGTTGCCA 
TTCGTTGCCA 
TTTGTAGCGA 
TTTGTAGGGA 
TTOGTTGCCA 
TTCGTTGCAA 
TTTGTAGCAA 
TTCGTGGCTA 
TT-GT-GC-A 



950 

ACCACGATAC 
ACCACGATAG 
ACCACGATAC 
ACCACGATAC 
ACCACGATAC 
ACCACGACAC 
AGGAGGAGAG 
ACCACGATAC 
ACCACGA-AC 



SEQIDNO: 83 
SEQIDNO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consensus 



951 

AGATATAATC 
AGATATAATG 
CGAGATAATC 
CGACATAATC 
CAACATAATC 
GGACATAATT 
CGATATAATC 
GAACATAATC 
-'A-ATAAT- 



TGGAACAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 
TGGAAGAAGT 



ATCCGGCTTA 
ATCCGGGTTA 
ATCCAGCCTA 
ATCCAGCCTA 
ATCCGGGCTA 
ACCCGGGCTA 
ACCTTGCTTA 
ACCCTGGCTA 
A-C--GC-TA 



TGCATTGATC 
TGGATTGATC 
CGCGTTGATC 
CGCGTTGATC 
CGCCTTCATC 
CGCGTTGATC 
TGCTTTGATC 
TGCCTTCATC 
-GC-TTGATC 



1000 
CTTAGCTATG 
CTTACCTATG 
CTCACCTACG 
CTGAGCTACG 
CTCACCTATG 
CTCACCTACG 
CTGAGCTACG 
CTGAGCTACG 
CT-ACCTA-G 
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SEQ ID NO: 83 
SEQIDNO:85 
SEQ ID Na 75 
SEQ ID NO: 77 
SEQ ID NQ: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



1001 

AGGGACAGCC 
AGGGACAGCC 
AGGGCCAGCC 
AGGGCCAGCC 
AGGGACAGCC 
AGGGCCAGCC 
AAGGCCAGCC 
AAGGTCAGCC 
A-GG-CAGCC 



TGTTATAT TC 
TGTTATATTC 
GACAATATTC 
GACAATATTC 
GGCAATATTC 
GACGATATTC 
CGTCATATTT 
CGTCATCTTC 
AT-TT- 



TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 
TACCGCGACT 



ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 
ACGAGGAGTG 



1050 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 
GCTCAACAAG 



SEQ ID NO: 83 

SEQ ID NO: 85 

SEQ ID NO: 75 
SEQ ID NO: 77 

SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



1051 

GATAAGCTTA 
GATAAGCTTA 
GACAAGCTCA 
GATAAGCTCA 
GACAGGCTCA 
GACAGGCTCA 
GACAGGTTGA 
GACAAACTCA 
GA-A---T-A 



ACAACCTCAT 
ACAACCTCAT 
ACAACCTCAT 
ACAACCTCAT 
GGAACCTCAT 
ACAACCTCAT 
ACAACCTCAT 
ACAACCTCAT 
--AACCTCAT 



CTGGATACAC 
CTGGATACAC 
CTGGATACAT 
CTGGATACAT 
CTGGATACAC 
CTGGATACAC 
ATGGATACAC 
ATGGATTCAC 
-TGGAT-CA - 



GATCACCTTG 
GATCACCTTG 
GACAACCTCG 
GACAACCTCG 
GACCACCTCG 
GACCACCTCG 
GACCACCTCG 
GAGCACCTGG 
GA--ACCT-G 



1100 
CTGGAGGGAG 
CTGGAGGGAG 
CCGGAGGGAG 
CCGGAGCGAG 
CGGGAGGAAG 
CCGGTGGAA6 
CAGGTGGAAG 
CAGGGGGAAG 
C-GG-GG-AG 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



1101 

TACTGACATT 
TACTGACATT 
CACTGACATC 
CACTGACATC 
CACAGACATC 
CACCGACATA 
CACGAGCATA 
CACCAAGATC 
-AC AT- 



GTTTACTACG 
GTTTACTACG 
GTTTACTACG 
GTTTACTACG 
ATCTACTACG 
GTCTACTACG 
GTTTACTACG 
CTCTACTACG 
-T-TACTACG 



ACAGCGACGA 
ACAGCGACGA 
ACAACGACGA 
ACAAC6ACGA 
ACAGCGACGA 
ATAACGATGA 
ACAGCGACGA 
ACGACGATGA 
A---CGA-GA 



GCTTATCTTT 
GCTTATCTTT 
GCTGATATTC 
GCTGATATTC 
GCTTATCTTC 
ACTCATCTTC 
GATGATTTTC 
GCTCATCTTC 
--T-AT-TT- 



1150 
GTGAGAAACG 
GTGAGAAACG 
GTGAGAAACG 
GTGAGAAACG 
OTGAGAAACG 
GTCAGGAACG 
GTGAGGAACG 
ATGAGGGAAG 
-T-AG--A-G 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consen sus 



1151 

GCTATGG CAC 
GCTATGG CAC 
GCTACGGAAG 
GCTACGGAAG 
GCTACGGGGA 
GCTACGGGGA 
GCTATGG AAG 
GCTACGGCGA 
GCTA-GG--- 



CAAACCAGGA 
CAAACCAGGA 
CAAGCCGGGA 
CAAGCCGGGA 
CAAGCCGGGA 
CAAGCCGGGG 
CAAGCCTGGC 
CAGGCCCGGG 
CA--CC-GG- 



CTGATAACCT 
CTGATAACCT 
CTGATAACAT 
CTGATAACAT 
CTGATAACCT 
CTTATAACCT 
CTTATAACTT 
CTTATAACCT 
CT-ATAAC- T 



ATATCAACCT 
ATATCAACCT 
ACATCAACCT 
ACATCAACCT 
ACATCAACCT 
ACATCAACCT 
ACATCAACCT 
ACATCAACCT 
A-ATCAACCT 



1200 
CGGCTCAAGC 
CGGCTCAAGC 
CGGCTCAAGC 
CGGCTCAAGC 
CGGCTCAAGC 
AGGCTCGAGC 
CGGCTCGAGC 
CGCTAGCGAC 
-G C 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQ ID NO: 75 
SEQ ID Na 77 
SEQ ID NO: 73 
SEQ ID NO: 79 

SEQ ID NO: 81 
CLONE A 

Consen sus 



1201 

AAAGTTGGAA 
AAAGCTGGAA 
AAAGCCGGAA 
AAAGCCGGAA 
AAGGCCGGAA 
AAGGCCGGGA 
AAGGTTGGAA 
TGGGCGGAGA 
---G--G--A 



GGTGGGTCTA 
GGTGGGTCTA 
GGTGGGTTTA 
GGTGGGTTTA 
GGTGGGTCTA 
GGTGGGTCTA 
GGTGGGTTTA 
GATGGGTGAA 
G-TGGGT- -A 



CGTT. . .CCA 
CGTT. . .CCA 
CGTT . . . CC G 
CGTT. . .CCG 
CGTT . , . CC G 

CGTT CCG 

TCTG. . .CCG 
CGTTG6CTCA 
-GT C- 



AAGTTCGCCG 
AAGTTCGCCG 
AAGTTCGCAG 
AAGTTCGCAG 
AAGTTCGCAG 
AAGTTCGCGG 
AAGTTCGCCG 
AAGTTCGCCG 
AAGTTCGC-G 



1250 
GTTCATGCAT 
GTTCATGCAT 
GCTCGTGCAT 
GCTCGTGCAT 
GCTCGTGCAT 
GAGCGT6CAT 
GCGCGTCCAT 
GCTATACAAT 
G AT 
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Figure 14c (cont.) 



SEQIDNO: 83 
SEQ ID NO: 8$ 
SEQ ID NO: 75 
SEQIDNO: 77 
SEQ ID Na 73 
SEQ ID NO: 79 
SEQ ID Nasi 
CLONE A 

Consen sus 



1251 

CCACX3AGTAC 
CCACGAGTAC 
ACACX3AGTAC 
ACACGAGTAC 
ACACGAGTAC 
CCACGAGTAC 
CCACGAGTAT 
CCACGAATAC 
-CACGA- TA- 



ACCGGCAACC 
ACCGGCAGCC 
ACCGGCAACC 
ACCGGCAATC 
ACCGGCAACC 
ACCGGCAACC 
ACTGGTAACC 
ACCGGAAACC 
AC-GG-A- -C 



TOGGCGGTTG 
TCGGCGGTTG 
TCGGCGGCTG 
TCGGCGGCTG 
TCGGCGGCTG 
TCGGCGGCTG 
TCGGAGGCTG 
TCGGCGGCTG 
TCGG-GG-TG 



GATAGACAAG 
GATAGACAAG 
GGTGGACAAG 
GGTGGACAAG 
GATTGACAAG 
GGTGGACAAG 
GGTAGACAAG 
GGTCGACAGG 
G-T-GACA-G 



1300 
TACGTCTCCT 
TACGTCTCCT 
TGGGTGGACT 
TGGGTGGACT 
TGGGTTGACT 
TGGGTGGACT 
TAOGTCTACT 
TACGTCCAGT 
T--GT T 



SEQ ID NO: 83 
SEQ ID NO: 85 
SEQIDNO: 75 
SEQ ID NO: 77 
SEQ ID NO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consensus 



1301 

CCAGCGGCTG 
CCAGCGGCTG 
CAAGCGGCTG 
CAAGCGGCTG 
CAAGCGGTCG 
CAAGCGGCTG 
CAAGCGGCTG 
ACGACGGCTG 
CGG--G 



GGTCTATC TT 

GGTCTACCTT 
GGTTTACCTC 
GGTCTACCTC 
GGTCTACCTT. 
GGTGTACCTC 
GGTCTATTTC 
GGTCAAGCTT 
GGT--A--T- 



GAGGCCCCAG 
GAGGCCCCGG 
GAGGCTCCTG 
GAGGCTCCTG 
GAGGCCCCCG 
GAGGCCCCTG 
GAAGCTCCAG 
ACCGCTCCGC 
---GC-CC-- 



CCCACGACCC 
CCCACGACCC 
CCCACGACCC 
CCCACGACCC 
CCCACGACCC 
CCCACGACCC 
CTTACGACCC 
CACACGATCC 
C--ACGA-CC 



1350 
GGCGAACGGC 
GGCCAATGGC 
GGCCAACGGC 
GGCCAACGGC 
GGCCAACGGC 
GGCCAACGGC 
TGCCAACGGG 
GGCAAACGGC 
-GC-AA-GG- 



SEQIDNO:83 
SEQIDNO: 85 
SEQIDNO: 75 
SEQIDNO: 77 
SEQIDNO: 73 
SEQ ID NO: 79 
SEQ ID NO: 81 
CLONE A 

Consensus 



1351 

TACTACGGCT 

CAGTATGGCT 
CAGTACGGCT 
CAGTACGGCT 
CAGTACGGCT 
TATTACGGCT 
CAGTATGGCT 
TATTACGGCT 
-A-TA-GGCT 



ACTCCGTATG 
ACTCCGTCTG 
ACTCCGTTTG 
ACTCCGTCTG 
ACTCCGTATG 
ACTCCGTCTG 
ACTCCGTGTG 
ACTCCGTCTG 
ACTC-GT- TG 



GAGCTACTGC 
GAGCTATTGC 
GAGCTATTGC 
GAGCTACTGC 
GAGCTACTGC 
GAGCTACTGC 
GAGCTATTG C 
GAGCTACGCC 
GAGCTA---C 



1393 

GGGGTTGGGT GA- 

GGGGTTGGGT GA- 

GGTGTTGGGT GA- 

GGTGTTGGGT GA- 

GGTGTTGGGT GA- 

GGGGTGGGCT GA- 

GGTGTTGGGT GA- 

GGAGTTGGAT GA- 

GG-GT-GG-T GA- 
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Neighbor-joining tree for Thermococcales 
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Figure 16 (all sequences are listed in 5' to 3' order) 
SEQIDNO.:! 

atggcaaagtattccgagctcgaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatgg^ttcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaacccctt 

cgtgaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggactg 

gtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgactt 

cccgctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagcc 

gcgacccgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgag 

ggccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccgg 

aggaagcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaa 

cctcgcctcaagcaaagccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggt 

ggacaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagc 

tactgcggtgttgggtga 

SEQ ID NO.: 2 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Tip Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val 
Val Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Ala 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp lie His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 3 

atggccaagtacctggagctcgaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctacggcatcaaggtcatcgcagacatagtaatcaaccaccgcgccggaggagaccttgagtggaacccct 

tcgtcaatgactacacctggacggacttctcgaaggtcgcttccggcaagtacacggccaattacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggt 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgac 

ccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggcc 



wo 02/068589 
Figure 16 (cont.) 



27/116 



PCT/US02/05068 



agccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaaccto 

agcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctc 

gcctcaagcgaagccggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtgg 

acaagtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactc^^ 

gcggtgttgggtga 

SEQIDN0.:4 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Tip Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp Ala 
Val Leu Asn Tip Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp lie He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp lie Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Glu Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly ... • 

SEQ ID NO.: 5 

atggccaagtactccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcaagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaacccctt 

cgtgaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggt 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgac 

ccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtaccttgcttatgctttcatcctcacctacgaaggccag 

cccgtcatattctaccgcgaccacgaggagtggctcaacaaggacaggttgaacaacctcatatggatacacgaccacctcgcaggtggaag 

caccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggc 

tcgagcaaggccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtagacaa 

gtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtgga 

gtgggctga 

SEQ ID NO.: 6 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp lie Val He Asn His Arg Ala Gly Gly 
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Asp Leu GIu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Tip Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Tip Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Leu Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp His Glu Glu Trp Leu Asn 
Lys Asp Arg Leu Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDN0.:9 

atggccaagtactccgagctggaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgcmggctccaagcaggagctcg^g 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatg 

cgggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggt 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgcg 

acccgttcaaggctgtgacgjttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggc 

cagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggagg 

aagcactgacatcgtttactacgacaacgacgagctgatattcgcgagaaacggctacggaagcaagccgggactgataacatacatcaacct 

cgcctcaagcaaagccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtgg 

acaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctac 
tgcggtgttgggtga 

SEQIDNO.: 10 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Tip Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Tip Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Tip Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Tip He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Ala Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu 
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Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tip Val Asp Ser Ser Gly Tip Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.ill 

atggccaagtacctggagctcgaggagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaacccctt 

cgtgaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccaigtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggt 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgcg 

acccgncaaggctgtgacgtttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctac 

cagccgacaatattctaccgcgactacgaggagtggctcaacaaggatacgctcaagaacctcatctggatacatgacaacctcgccggagg 

aagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctc 

ggctcgagcaaggttggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtgg 

acaagtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactccgtctggagctac 

^gcggtgttggctga 

SEQroN0.:12 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Tip Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe lie Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyt Arg. Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Thr Leu Lys Asn Leu He Tip He His Asp Asn Leu Ala Gly Gly Ser Thr Ser He Val Tyr 
Tyr Asp Ser Asp Glu Met lie Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu lie Thr Tyr He 
Asn Leu Gly Ser Ser Lys Val Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Tip Val Asp Lys Tip Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 13 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtatgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaac 

atgataaacacggcacatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgtt 

cgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaagtg 

ctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagctac 

gccgcctacctaaggagcatcggcgttgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggactg 

gtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgactt 
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cccgctctactacaagatggatgaggccmgacaacaaaaacanccagcgctcgtctctgcccncagaacggccagactgttgtctcccgc^ 

acccgncaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 

ccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggag 

gaagcactgac^tagtctactacgataacgatgaactcatcncgtcaggaacggctacggggacaagccggggcttataacctaca^^ 

aggctcgagcaaggccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtag 

acaagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagctactg 

cggtgttggctga 

SEQIDNO.: 14 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val lie Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Tip Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp 
Val Val Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp 
Ala Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Glu Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Tip Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val 
Tyr Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr 
He Asn Leu Gly Ser Ser Lys Ala Gly Arg Ttp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Tip Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu 
Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 15 

atggccaagtactccgagctggaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatg 

cgggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggactg 

gtgggg^ggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgactt 

cccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtc^^ 

acccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatttggaacaagtacccggcctacgccttcatcctcacctacgaggg 

ccagccgacgatattctaccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgaccaccttgccggtg 

gaagcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacc 

tcgcctcaagcaaagccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtag 

acaagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagctattgc 

ggtgttgggtga 

SEQHDNO.: 16 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Tip Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Tip 
lie Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
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Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin GIu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Tip Val 
Val Lys Asp Trp Leu Asp Trp Trp Gly Gly Tq> Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Arg Leu Lys Asn Leu He Tip He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDN0.:17 

atggccaagtactccgagctggaagggggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccaggagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaagt 

gctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagcta 

cgccgcctacctaaggagcatcggcgttgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggact 

ggtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgac 

ttcccgctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagc 

cgcgacccgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacga 

gggccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccg 

gaggaagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaa 

cctcggctcgagcaaggttggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggt 

ggacaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagc 

tactgcggtgttgggtga 

SEQIDNO.: 18 

Met Ala Lys Tyr Ser Glu Leu Glu Gly Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Glu Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp 
Val Val Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp 
Ala Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Ala Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Ser He Val 
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Tyr Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Set Lys Pro Gly Leu He Thr Tyr 
He Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Tip Val Asp Lys Trp Val Asp Ser Ser Gly Tip Val Tyr Leu Glu 
Ala Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 19 

atggccaagtacctggagctcgaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctg^^ 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgaccttggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttc 

gttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgcg 

ggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgc 

ggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggct^ 

gggggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtcmgacttcgcc 

ctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgaccc 

gttcaaggccgtaacctttgtagcaaaccacgacaccgatataatUggaacaagtacccggcctacgccttcatcctcacctacga^ 

ccgacgatattctaccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgaccacctcgccggtggaag 

cactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctcgc 

ctcaagcaaagccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagcatactggtaacctcggaggctgggtaga 

agtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccg^g^^ 

tgttggctga 

SEQIDNO.: 20 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr He Ala Gin Lys He Pro Asp Tip Ala Ser Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Tip Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly lie Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Tip Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val 
Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Tip Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Arg Leu Lys Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr 
Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Ala Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu His 
Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro 
Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 21 

atggccaagtactccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcctcccgggagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgatgatttggacctgggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaac 

atgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgtt 

cgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgc 

gggcgaltccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 
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gcggtatatcteaggagcatcggcatcgatgcctggcgcUcgactacgtcaagggctacggagcgtgggtcgtcaa 

gtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgactt 

cccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcg 

acccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataamggaacaagtacccggcctacgccttcatcc^^ 

ccagccgacgatattctaccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgactacctcgccggtg 

gaagcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacc 

tcgcctcaagcaaagccggaaggtgggmatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggc 

acaagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccg^^ ^ 

ggtgttggctga 

SEQID NO.: 22 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Tip 
He Pro Pro Gly Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Asp Leu Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gbi Glu Ser Tyr Ala Val 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Arg Leu Lys Asn Leu He Trp He His Asp Tyr Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQID NO.: 23 

atggccaagtactccgagctggaagagggcggcgttatagtgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatg 

cgggcg^(^<^cgg^3C3tttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggact^ 

gtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgactt 

cccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcg 

acccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 

ccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggag 

gaagcatgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctc 

ggctcgagcaaggttggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtgg 

acaagtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactccgtctggagctatt 

gcggtgttggctga 



SEQID NO.: 24 
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Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val lie Val Glii Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Tip Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val lie Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val 
Val Lys Asp Trp Leu Asp Tip Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe. Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Met Ser He Val Tyr 
Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.:25 

atggccaagtacctggagctcgaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgat 

agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagcteataaaca^ 

gataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaaccccttcgt 

gaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgcggg 

cgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactg^tctgggccagccaggagagctacgcgg 

catatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggtgggg 

aggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcgccctc 

tactacaagatggacgaggccttcgataacaacaacattcccgccctggtgggcgccctcagatacggtcagacagtggtcagccgcgaccc 

gttcaaggctgtgacgtttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccagc 

cgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggaggaagc 

accgacatagtctactacgataacgatgaactcatcttcgtcaggcacggctacggggacaagccggggcttataacctacatcaacctaggct 

cgagcaaggccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtggacaa 

gtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctattgcg 

gtgttgggtga 

SEQ ID NO.: 26 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met 
Gly Gly He Tip Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly Met Lys Val lie Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val 
Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Asn Asn He Pro Ala Leu Val Gly Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp 
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Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp lie He Trp Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr 
Asp Asn Asp Glu Leu lie Phe Val Arg His Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Gly Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr 
Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro 
Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQroNO.: 27 

atggcaaagtattccgagctcgaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccatc 

aggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcctcccgcgagcaagggtatgagcggcggctattcgatggg 

ctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacatg 

ataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttcgt 

tggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgcggg 

cgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcgg 

catatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggtgggg 

aggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacUcgccctc 

tactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggcccteaagaacgggggcacagtcgtcagccgcgacc 

cgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccag 

ccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggaggaag 

cactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctcgc 

gtcaagcaaagccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtggaca 

agtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctactgc 

ggtgttgggtga 

SEQ ID NO.: 28 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gbi Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val 
Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Ala Ala 
Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp lie Val Tyr Tyr 
Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr 
Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro 
Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 29 

atggccaagtacctggagctcgaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacggt 
agcccagaagatacccgactgggcaagcgccgggatttcggcgatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 
ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 
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catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggaga^ 

ttcgttggggactacacctggacggacttctcaaaggtggtctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactg^ 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgac 

ccgttcaaggccgtaaccmgtagcaaaccacgacaccgatataatctggaacaagtaccttgcttatgccttcat^^ 

cccgtcatattctaccgcgactacgaggagtggctcaacaaggacaggttgaacaacctcatatggatacacgaccacctcgcagggggaag 

caccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggc 

tcgagcaaggccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcgg^^ 

gtacgtctactcaagcggctgggtctatctegaagctccagcttacgaccctgccaacgggcagtatggctactcc^^ 

gttgggtga 

SEQIDNO.: 30 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Tip Asp Val Pro Met 
Gly Gly He Trp Trp Asp Thr Val Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly lie Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Val Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Leu Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Arg Leu Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu lie Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 31 

atggcaaagtactccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 
caggagcaggataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 
ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 
catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 
ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatg 
cgggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 



tggggcggctgggccgttggcgagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgacttc 

ccgctctactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgc 

gacccgttcaaggctgtgacgtttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 

ccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctggccggag 

gaagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaccggctatggaagcaagcctggccttataacttacatcaacct 

cggctcgagcaaggttggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtaga 

caagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagctattgcg 
gtgttggctga 
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SEQIDNO.:32 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Arg He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Ser He Val Tyr 
Tyr Asp Ser Asp Glu Met He Phe Val Arg Thr Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 33 

atggccaagtactccgagctggaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttc 

gttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgcg 

ggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgc 

ggcatatctcaggagcatcggcatcgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactggtg 

gggcggctgggccgttggcgagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgactttcc 

gctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagccgcg 

acccgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaccaagtaccttgcttatgctttcatcctcacctacgaaggcca 

gcccgtcatattctaccgcgactacgaggagtggctcaacaaggacaggttgaacaacctcatatggatacacgaccacctcgcaggtggaag 

caccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggc 

tcgagcaaggccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtggacaa 

gtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctac^^^ 

gtgttggctga 

SEQIDNO.: 34 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
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Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Ala 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Tip Thr Lys Tyr Leu Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Arg Leu Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.:35 

atggccaagtactccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctacggcatcaaggtcatcgcagacatagtaatcaaccaccgcgccggaggagaccttgagtggaacccct 

tcgtcaatgactacacctggacggacttctcgaaggtcgcttccggcaagtacacggccaactacctcgacttccaccccaacgaggtcaagtg 

ctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagctac 

gccgcctacctaaggagcatcggcgttgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggt 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcg 

ccctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagccgc 

gacccgtttaaggccgtaaccUcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcc^^ 

ccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacgtcgccggag 

gaagcaccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacct 

aggctcgagcaaggccggaaggtgggtttacgUccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtgg 

acaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctg^^ 

tgcggtgttgggtga 

SEQ ID NO.: 36 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp 
Val Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp 
Ala Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp 
Ala Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Val Ala Gly Gly Ser Thr Asp He Val 
Tyr Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr 
lie Asn Leu Gly Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu 
Ala Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 71 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgcctlcaggaggaatatggtgggacacaat 
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acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgcctattcp^^ 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgUggggactacacctggacggacUctcaaaggtagcctcgggcaaatatactgccaactacctcgacttccacccgaacgagcto 

cgggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaact^ 

ggggaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtcmga^^^ 

ccctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgac 

ccgtlcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggcc 

agccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggagga 

agcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctc 

gcctcaagcaaagccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtaga 

caagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctacte^ 

ggggtgggctga 

SEQroNO.:72 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val lie Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly lie Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Tip Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Ghi Glu Leu Val Asn 
Met lie Asn Thr Ala His Ala Tyr Gly lie Lys Val He Ala Asp He Val lie Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Val Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe lie Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.:49 

gtggtttatgacgatgtccgctatgacctttatgccgtaggcatgggccgtgtttatcatgttcacgagctcctgcttggagccaaagcgcgtctct 

accgttcccttctggtcgtactcaccgaggtcaaagaagtcgtaggggtcgtagcccatcgaataggcgccgcccatgcccttgctcgccggg 

ggaatccatatcgccgaaatcccggcgcttgcccagtcgggtatcttctgggctatcgtgtcccaccagattcctcccatggggacgtcccagta 

gaaggcctgcattatgagcccgccctcttcgagcccggaatactttgccataagttacctcctactagtagattaaaattctgtttcctgtgtgaaatt 

gtt 

SEQIDNO.iSO 

Val Val Tyr Asp Asp Val Arg Tyr Asp Leu Tyr Ala Val Gly Met Gly Arg Val Tyr His Val His Glu 
Leu Leu Leu Gly Ala Lys Ala Arg Leu Tyr Arg Ser Leu Leu Val Val Leu Thr Glu Val Lys Glu Val 
Val Gly Val Val Ala His Arg He Gly Ala Ala His Ala Leu Ala Arg Arg Gly Asn Pro Tyr Arg Arg 
Asn Pro Gly Ala Cys Pro Val Gly Tyr Leu Leu Gly Tyr Arg Val Pro Pro Asp Ser Ser His Gly Asp 
Val Pro Val Glu Gly Leu His Tyr Glu Pro Ala Leu Phe Glu Pro Gly He Leu Cys His Lys Leu Pro 
Pro Thr Ser Arg Leu Lys Phe Cys Phe Leu Cys Glu lie Val 
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SEQIDN0.:51 

ATGGCCAAGTACCTGGAGCTCGAAGAGGGCGGGGTCATAATGCAGGCGITCTACTGGG 

ACGTGCCTTCAGGAGGAATATGGTGGGACACAATACGGCAGAAGATACCGGAGTGGT 

ACGATGCCGGAATCTCCGCAATATGGATTCCCCCGGCGAGCAAGGGCATGGGCGGCGC 

CTATTCGATGGGCTACGACCCCTACGACTTCTTTGACCTCGGTGAGTACGACCAGAAG 

GGAACGGTAGAGACGCGCTTTGGCTCCAAGCAGGAGCTCGTGAACATGATAAACACC 

GCCCACGCCTATGGCATGAAGGTAATAGCCGATATAGTCATCAACCACCGCGCCGGCG 

GTGACCTGGAGTGGAACCCCTTCGTGAACGACTATACCTGGACCGACTTCTCAAAGGT 

CGCGTCGGGTAAATACACGGCCAACTACCTCGACTTCCACCCCAACGAGGTCAAGTGC 

TGTGACGAGGGCACATTTGGAGGCTTCCCAGACATAGCCCACGAGAAGAGCTGGGAC 

CAGCACTGGCTCTGGGCGAGCGATGAGAGCTACGCCGCCTACCTAAGGAGCATCGGCG 

TTGATGCCTGGCGCTTTGACTACGTGAAGGGCTACGGAGCGTGGGTCGTCAAGGACTG 

GCTCAACTGGTGGGGCGGCTGGGCCGTTGGCGAGTACTGGGACACCAACGTTGATGCA 

CTCCTCAACTGGGCCTACTCGAGCGGCGCCAAGGTCTTCGACTTCCCGCTCTACTACAA 

GATGGATGAGGCCTTTGACAACAAAAACATTCCAGCGCTCGTCTCTGCCCTTCAGAAC 

GGCCAGACTGTTGTCTCCCGCGACCCGTTCAAGGCCGTAACCTTTGTAGCAAACCACG 

ACACCGATATAATCTGGAACAAGTATCCAGCCTACGCGTTCATCCTCACCTACGAGGG 

CCAGCCGACAATATTCTACCGCGACTACGAGGAGTGGGTCAACAAGGATAAGCTCAAG 

AACCTCATCTGGATACATGACAACCTCGCCGGAGGAAGCACTGACATCGTTTACTACG 

ACAACGACGAGCTGATATTCGTGAGAAACGGCTACGGAAGCAAGCCGGGACTGATAA 

CATACATCAACCTCGCCTCAAGCAAAGCCGGAAGGTGGGTTTACGTTCCGAAGTTCGC 

AGGCTCGTGCATACACGAGTACACCGGCAATCTCGGCGGCTGGGTGGACAAGTGGGTG 

GACTCAAGCGGCTGGGTCTACCTCGAGGCTCCTGCCCACGACCCGGCCAACGGCCAGT 

ACGGCTACTCCGTCTGGAGCTATTGCGGTGTTGGCTGA 

SEQIDNO.: 52 

MAKYLELEEGGVIMQAFWDVPSGGIWWDTIRQKIPEWYDAGISAIWIPPASKGMGGAYS 

MGYDPYDFFDLGEYDQKGTVETRFGSKQELVNMINTAHAYGMKVIADIVINHRAGGDLE 

WNPFVNDYTWTDFSKVASGKYTANYLDFHPNEVKCCDEGTFGGFPDIAHEKSWDQHWL 

WASDESYAAYLRSIGVDAWRFDYVKGYGAWWKDWLNWWGGWAVGEYWDTNVDAL 

LNWAYSSGAKWDFPLYYKMDEAFDNKNIPALVSALQNGQTVVSRDPFKAVTFVANHDT 

DIIWNKYPAYAFILTYEGQPTIFYRDYEEWlJ^KDia.Km.IWIHDNLAGGSTDIVYYDNDEL 

FVW^GYGSKPGLITYINLASSKAGRWVYWKFAGSCIHEYTGNLGGWVDKWVDSSGWVY 

LEAPAHDPANGQYGYSVWSYCGVG 

SEQ ID NO.: 37 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaacccctt 

cgtgaacgactalacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgc 

gggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctac 

gcggcatatctcaggagcatcggcatcgatgcctggcgctttgactacgtgaagggctacggagcgcgggtcgtcaaggactggctcaactg 

gtggggcggctgggccgRggcgagtactgggacaccaacgttgatgcactcctoaactgggcctactegagcggcgccaaggtcttcgactt 

cccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcg 

acccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 

ccagccgacaatattctatcgcgactacgaggagtggcteaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggagg 

aagcactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacct 

cgccteaagcaaagccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtgg 



I 
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acaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtac^^ 
tgcggggtggggtga 

SEQ ID NO.: 38 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly 
Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Arg Val 
Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr lie 
Asn Leu Ala Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQ H) NO.: 39 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcctcccgcgagcaggggtatgagcggcggct^^ 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaaccccttcgt 

gaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgcggg 

cgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcgg 

catatctcaggagcatcggtatcgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactggtggg 

gcggctgggccgttggcgagtactgggaccccaacgttgatgccctcctcccctgggcctactcgagcggcgccaaggtcttcgacttcc 

tctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgacccg 

ttcaaggccgtaacctttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccagcc 

gacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggaggaagca 

ccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggctc 

gagcaaggccggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtggacaa 

gtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactcc^^^ 

gggtgggctga 

SEQ ID NO.: 40 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Arg Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Tip Ala Ser Gin Glu Ser Tyr Ala Ala 
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Tyr Leu Arg Ser He Gly lie Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
Lys Asp Trp Leu Asn Trp Tip Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Pro Asn Val Asp Ala Leu 
Leu Pro Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr 
Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Gly Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu Tyr 
Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tip Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro 
Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 41 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttc 

gttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccacccgaacgagctccatgcg 

ggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgc 

ggcatatctcaggagcatcggcatcgatgcctggcgcmgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactggt 

gggcggctgggccgttggcgagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgacttccc 

gctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagccgcg 

acccgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggc 

cagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggagg 

aagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctc 

ggctcgagcaaggnggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtaga 

aagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagc 

gtgttgggtga 

SEQ ro NO.: 42 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Tip Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Tip Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Ala 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe lie Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Tip Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Ser He Val Tyr 
Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 
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SEQIDNO.:43 

atggccaagtactccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacncmgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaagt 

gctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagcta 

cgccgcctacctaaggagcatcggcgttgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggact 

ggtggggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgac 

ttcccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgc 

gacccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatocagcctacgcgttcatcctcacctacgagg 

gccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgtcggag 

gaagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacct 

cggctcgagcaaggttggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtgg 

acaagtgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctac 

tgcggtgttggctga 

SEQIDNO.:44 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Tip Tyr Glu Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Tip Leu Tip Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Tip 
Val Val Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp 
Ala Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Glu Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Tip He His Asp Asn Leu Val Gly Gly Ser Thr Ser He Val 
Tyr Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr 
lie Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Tip Val Tyr Leu Glu 
Ala Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQID NO.: 45 

atggccaagtactccgacctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 

ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaagt 

gctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagcta 

cgccgcctacctaaggagcatcggcgttgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactg 

gtggggcggctgggccgttggcgagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgactt 

cccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcg 

acccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 

ccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggag 
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gaagcaccgacatagtctactacgataacgatgaactcatcncgtcaggaacggctacggggacaagccggggcttataacctacatcaac^^ 
aggctcgagcaaggccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtag 
acaagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagctattgc 
ggtgttgggtga 

SEQIDNO.:46 

Met Ala Lys Tyr Ser Asp Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp 
Val Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp 
Ala Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Glu Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val 
Tyr Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr 
He Asn Leu Gly Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Tip Val Tyr Leu Glu 
Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gbi Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 47 

atggccaagtacaccgagctggaagagggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcccccggcgagcaagggcatgggcggcgcctattcgatg 

ggctacgacccctacgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 

catgataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaacccctt- 

cgtgaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccaccccaacgaggtcaagtg 

ctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagctac 

gccgcctacctaaggagcatcggcgttgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactgg 

tggggcggttgggccgttggcgagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgacttc 

ccgctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcga 

cccgttcaaggccgtaacctttgtagcaaaccacgacaccgatataatctggaacaagtaccttgcttatgctttcatcctcacctacgaaggcca 

gcccgtcatattctaccgcgactacgaggagtggctcaacaaggacaggttgaacaacctcatatggatacacgaccacctcgcaggtggaag 

cacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctcggct 

cgagcaaggttggaaggtgggtttacgttccgaagttcgcaggcccgtgcatacacgagtacaccggcaatctcggcggctgggtggacaag 

tgggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctactgcgg 

tgttgggtag 

SEQIDNO.: 48 

Met Ala Lys Tyr Thr Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly 
Gly Gly He Tip Trp Asp Thr He Arg Ser Lys He Pro Glu Tip Tyr Glu Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
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Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Tip Leu Trp Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Tip Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Tip 
Val Val Lys Asp Trp Leu Asn Tip Tip Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp 
Ala Leu Leu Asn Tip Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Glu Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp lie He Trp Asn Lys Tyr Leu 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Arg Leu Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Ser He Val 
Tyr Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr 
He Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Pro Cys lie His 
Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu 
Ala Pro Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQIDNO.: 53 

ATGGCCAAGTACTCCGAGCTGGAAGAGGGCGGCGTTATAATGCAGGCCTTCTACTGGG 

ACGTCCCAGGTGGAGGAATCTGGTGGGACACCATCAGGAGCAAGATACCGGAGTGGT 

ACGAGGCGGGAATATCCGCCATTTGGATTCCCCCGGCGAGCAAGGGCATGGGCGGCG 

CCTATTCGATGGGCTACGACCCCTACGACTTCTTTGACCTCGGTGAGTACGACCAGAA 

GGGAACGGTAGAGACGCGCTTTGGCTCCAAGCAGGAGCTCGTGAACATGATAAACAC 

GGCCCATGCCTACGGCATAAAGGTCATAGCGGACATCGTCATAAACCACCGCACAGGC 

GGAGACCTCGAGTGGAACCCGTTCGTTGGGGACTACACCTGGACGGACTTCTCAAAGG 

TGGCCTCGGGCAAATATACTGCCAACTACCTCGACTTCCACCCCAACGAGGTCAAGTG 

CTGTGACGAGGGCACATTTGGAGGCTTCCCAGACATAGCCCACGAGAAGAGCTGGGA 

CCAGCACTGGCTCTGGGCGAGCGATGAGAGCTACGCCGCCTACCTAAGGAGCATCGGC 

GTTGATGCCTGGCGCTTCGACTACGTCAAGGGCTACGGAGCGTGGGTCGTCAAGGACT 

GGCTGGACTGGTGGGGAGGCTGGGCCGTCGGGGAGTACTGGGACACAAACGTTOATG 

CACTGCTCAACTGGGCCTACTCGAGCGATGCAAAAGTCTTCGACTTCCCGCTCTACTAC 

AAGATGGATGAGGCCTTTGACAACAAAAACATTCCAGCGCTCGTCTCTGCCCTTCAGA 

ACGGCCAGACTGTTGTCTCCCGCGACCCGTTCAAGGCCGTAACCTTTGTAGCAAACCA 

CGACACCGATATAATCTGGAACAAGTATCCAGCCTACGCGTTCATCCTCACCTACGAG 

GGCCAGCCGACAATATTCTACCGCGACTACGAGGAGTGGCTCAACAAGGATAAGCTCA 

AGAACCTCATCTGGATACATGACAACCTCGCCGGAGGAAGCACTGACATCGTTTACTA 

CGACAACGACGAGCTGATATTCGTGAGAAACGGCTACGGAAGCAAGCCGGGACTGAT 

AACATACATCAACCTCGCCTCAAGCAAAGCCGGAAGGTGGGTCTACGTTCCGAAGTTC 

GCGGGAGCGTGCATCCACGAGTACACCGGCAACCTCGGCGGCTGGGTGGACAAGTGG 

GTGGACTCAAGCGGGTGGGTGTACCTCGAGGCCCCTGCCCACGACCCGGCCAACGGCT 

ATTACGGCTACTCCGTCTGGAGCTACTGCGGTGTTGGCTGA 

SEQIDNO.: 54 

MAKYSELEEGGVIMQAFYWDVPGGGIWWDTIRSKIPEWYEAGISAIWIPPASKGMGGAYS 

MGYDPYDFFDLGEYDQKGTVETRFGSKQELVNMINTAHAYGnCVIADIVINHRTGGDLEW 

NPFVGDYTWTDFSKVASGKYTANYLDFHPNEVKCCDEGTFGGFPDIAHEKSWDQHWLW 

ASDESYAAYLRSIGVDAWRFDYVKGYGAWWKDWLDWWGGWAVGEYWDTNVDALL 

NWAYSSDAKVFDFPLYYKMDEAFDNKNIPALVSALQNGQTVVSRDPFKAVTFVANHDTD 

IIWh«CYPAYAFILTYEGQPTIFYRDYEEWLNKDKLKNLIWIHDNLAGGSTDIVYYDNDELIF 

VRNGYGSKPGLITYINLASSKAGRWVYVPKFAGACfflEYTGNLGGWVDKWVDSSGWVY 

LEAPAHDPANGYYGYSVWSYCGVG 
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atggccaagtacctggagctcgaggagggcggggtcataatgcaggcgttctactggg 

acgtgccttcaggaggaatatggtgggacacaatacggcagaagataccggagtggt 

acgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgc 

ctattcgatgggctacgacccctacgacttctttgacctcggtgagtacgaccagaag 

ggaacggtagagacgcgctttggctccaagcaggagctcgtgaacatgataaacacc 

gcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcg 

gtgacctggagtggaaccccttcgtgaacgactatacctggaccgacttctcaaaggt 

cgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgcg 

ggcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggacc 

agtactggctctgggccagccaggagagctacgcggcatatctcaggagcatcggcat 

cgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactgg 

ctcaactggtggggcggctgggccgttggcgagtactgggacaccaacgttgatgcac 

tcctcaactgggcctactcgagcggcgccaaggtcttcgacttcccgctctactacaag 

atggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacg 

gccagactgttgtctcccgcgacccgttcaaggccgtaacctttgtagcaaaccacga 

caccgatataatctggaacaagtaccttgcttatgctttcatcctcacctacgaaggcc 

agcccgtcatattctaccgcgactacgaggagtggctcaacaaggacaggttgaacaa 

cctcatatggatacacgaccacctcgcaggtggaagcacgagcatagtttactacgac 

agcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataactt 

acatcaacctcggctcgagcaaggttggaaggtgggtttacgttccgaagttcgcagg 

ctcgtgcatacacgagtacaccggcaatctcggcggctgggtggacaagtgggtggac 

tcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggcx:agtacg 

gctactccgtctggagctattgcggtgttggctga 

SEQ ID NO.: 56 

MAKYLEI^EGGVIMQAPYWDVPSGGIWWDTIRQKIPEWYDAGISAIWIPPASKGMGGAYS 

MGYDPYDFFDIXJEYDQKGTN^RFGSKQELVNMIhrrAHAYGMKVIADIVINHRAGGDLE 

WNPFVNDYTWTDFSKVASGKYTANYLDFHPNELHAGDSGTFGGYPDICHDKSWDQYWL 

WASQESYAAYLRSIGIDAWRFDYVKGYGAVm^WIJ^WWGGWAVGEYWDTNVDALL 

NWAYSSGAKVFDFPLYYKMDEAFDNKNIPALVSALQNGQTVVSRDPFKAVTFVANHDTD 

nWKYLAYAnLTYEGQPVIFYlU)YEEWLNKDRLNNLIWIHDHLAGGSTSIVYYDSDEMIF 

VRNGYGSKPGLITYINLGSSKVGRWVYVPKFAGSCIHEYTGNLGGWVDKWVDSSGWVYL 

EAPAHDPANGQYGYSVWSYCGVG 

SEQ ID NO.: 57 

ATGGCCAAGTACCTGGAGCTCGAAGAGAGCGGGGTCATAATGCAGGCGTTCTACTGGG 

ACGTGCCTTCAGGAGGAATATGGTGGGACACAATACGGCAGAAGATACCGGAGTGGT 

ACGATGCCGGAATCTCCGCAATATGGATTCCTCCCGCGAGCAAGGGTATGAGCGGCGG 

CTAltCGATGGGCTACGACCCCTACGATTATTTTGACCTCGGTGAGTACTACCAGAAGG 

GAACGGTGGAAACGAGGTTCGGCTCAAAGCAGGAGCTCATAAACATGATAAACACCG 

CCCACGCCTACGGCATCAAGGTCATCGCAGACATAGTAATCAACCACCGCGCCGGAGG 

AGACCTTGAGTGGAACCCCTTCGTCAATGACTACACCTGGACGGACTTCTCGAAGGTC 

GCTTCCGGCAAGTACACGGCCAACTACCTCGACTTCCACCCCAACGAGGTCAAGTGCT 

GTGACGAGGGCACATTTGGAGGCTTCCCAGACATAGCCCACGAGAAGAGCTGGGACC 

AGCACTGGCTCTGGGCGAGCGATGAGAGCTACGCCGCCTACCTAAGGAGCATCGGCGT 

TGATGCCTGGCGCTTTGACTACGTGAAGGGCTACGGAGCGTGGGTCGTCAAGGACTGG 

CTCAACTGGTGGGGTGGCTGGGCCGTCGGGGAGTACTGGGACACAAACGTTGATGCAC 

TGCTCAACTGGGCCTACTCGAGCGATGCAAAAGTCTTCGACTTCCCGCTCTACTACAAG 

ATGGACGAGGCCTTCGATAACAACAACATTCCCGCCCTGGTGGACGCCCTCAGATACG 

GTCAGACAGTGGTCAGCCGCGACCCGTTCAAGGCTGTGACGTTTGTAGCCAACCACGA 
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TACCGATATAATCTGGAACAAGTACCTTGCTTATGCTTTCATCCTCACCTACGAAGGCC 

AGCCCGTCATATTCTACCGCGACTACGAGGAGTGGCTCAACAAGGACAGGTTGAACAA 

CCTCATATGGATACACGACCACCTCGCAGGTGGAAGCACTGACATCGTTTACTACGAC 

/VACGACGAGCTGATATTCGTGAGAAACGGCTACGGAAGCAAGCCGGGACTGATAACA 

TACATCAACCTCGCCTCAAGCAAAGCCGGAAGGTGGGTCTACGTTCCGAAGTTCGCGG 

GAGCGTGCATCCACGAGTACACCGGCAACCTCGGCGGCTGGGTGGACAAGTGGGTGG 

ACTCAAGCGGGTGGGTGTACCTCGAGGCCCCTGCCCACGACCCGGCCAACGGCTATTA 

CGGCTACTCCGTCTGGAGCTATTGCGGTGTTGGCTGA 

SEQ ID NO.: 58 

MAKYLEI^ESGVIMQAFYWDWSGGIWWDTIRQKIPEWYDAGISAIWIPPASK^ 

MGYDPYDYFDLGEYYQKGWETRFGSKQELINMmXAHAYGIKVIADIXONHRAGGDLEW 

NPFVNDYTWTDFSKVASGKYTANYLDFHP>ffiVKCCDEGTFGGFPDIAHEKSWDQHWLW 

ASDESYAAYLRSIG\^DAWRFDYVKGYGAWVVKDWLNWWGGWAVGEYWDT^^ 

NWAYSSDAKVFDFPLYYKMDEAFDNNNIPALVDALRYGQTWSRDPFKAV^ 

IIWNKYLAYAJ?ILTYEGQPVIFYRDYEEWLh«a)RLNNLIWIHDHL^ 

VRNGYGSKPGLITYINLASSKAGRWVYVPKFAGACIHEYTGNLGGWVDKWVDSSGWVY 

LEAPAHDPANGYYGYSVWSYCGVG 

SEQ ID NO.: 59 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacaccgcccacgcctacggcatcaaggtcatcgcagacatagtaatcaaccaccgcgccggaggagaccttgagtggaaccccttcg 

tcaatgactacacctggacggacttctcgaaggtcgcttccggcaagtacacggccaactacctcgacttccacccgaacgagctccatgcgg 

gcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcg 

gcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggtg^^ 

gaggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcgccct 

ctactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgcgacc 

cgncaaggctgtgacgtttgtagccaaccacgataccgatataamggaacaagtacccggcctacgccncatcctca 

ccgacgatattctaccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgaccacctcgccggtggaag 

cactgacatcgtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctcgc 

gtcaagcaaagccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacc^^ 

agtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactc^^ 

gttgggtga 

SEQ ID NO.: 60 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val lie Met Gin Ala Phe Tyr Tip Asp Val Pro Set 
Gly Gly He Tip Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val 
Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val 
Leu Asn Trp Ala Tyr Ser Ser Gly. Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp lie He Trp Asn Lys Tyr Pro Ala Tyr 
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Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Arg Leu Lys Asn Leu lie Tip He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr 
Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Ala Ser Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu Tyr 
Thr Gly Asn Leu Gly Gly Tip Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro 
Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDN0.:61 

atggccaagtactccgagctgaaaaagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgaggcgggaatatccgccatttggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacaccgcccacgcctacggcatcaaggtcatcgcagacatagtaatcaaccaccgcgccggaggagaccttgagtggaaccccttcg 

tcaatgactacacctggacggacttctcgaaggtcgcttccggcaagtacacggccaactacctcaacttccacccgaacgagctccatgcgg 

gcgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcg 

gcatatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctacggagcg^gggtcgtcaaggactggctggact^^ 

gggaggctgggccgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgacttccc 

gctctactacaagatggatgaggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgacc 

cgttcaaggccgtaaccWgtagcaaaccatgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccag 

ccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggaggaag 

caccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggc 

tcgagcaaggccggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtggaca 

agtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactccgtct^^ 

ggggtgggctga 

SEQIDNO.:62 

Met Ala Lys Tyr Ser Glu Leu Lys Lys Gly Gly Val He Met Gin Ala Phe Tyr Tip Asp Val Pro Ser 
Gly Gly He Tip Trp Asp Thr He Arg Gin Lys He Pro Glu Tip Tyr Glu Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asn Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Tip Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Tip Val Val 
Lys Asp Trp Leu Asp Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Tip Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Tip Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Tip He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr 
Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQIDNO.:63 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaat 
acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgcctattcgatg 
ggctacgaccccUcgacttctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaa 
catgataaacacggcccatgcctacggcataaaggccatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccg 
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ttcgttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttcxaccccaacgag^^ 

gctgtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagcta 

cgccgcctacctaaggagcatcggcgttgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactg 

gtggggcggctgggccgttggcgagjactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgc^ 

cccgctctactacaagatggacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagcc 

gcgacccgtttaaggccgtaaccttcgttgcaaaccacgacaccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgag 

ggccagccgacaatattctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcg 

aggaagcaccgacatagtctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaa 

cctaggctggagcaaggccggaaggtgggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctggg 

tagacaagtacgtctactcaagcggctgggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggc^ 

ctgcggggtggggtga 

SEQ ID NO,: 64 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Tip Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp 
Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val Asn 
Met He Asn Thr Ala His Ala Tyr Gly He Lys Ala He Ala Asp He Val He Asn His Arg Ala Gly Gly 
Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys 
Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly 
Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Tip Ala Ser Asp Glu Ser Tyr 
Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Tip 
Val Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Tip Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp 
Ala Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp 
Ala Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val Ser 
Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro 
Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu 
Asn Lys Asp Lys Leu Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val 
Tyr Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr 
He Asn Leu Gly Trp Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His 
Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu 
Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 65 

atggccaagtactccgagctggaagaaggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtggggcaccat 

caggagcaagataccggagtggtacgaggcgggaatatccgccatttggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacaccgcccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaaccccttcgt 

gaacgactatacctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgcggg 

cgattccggaacatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcgg 

catatctcaggagcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggtgggg 

^ggctgggcggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcgccc^ 

tactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgcgaccc 

gttcaaggctgtgacgtttgtagccaaccacgataccgatataatttggaacaagtacccggcctacgccttcatcctcacctacgagggccagc 

cgacgatattctaccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgaccacctcgccggtggaagc 

acgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctcggctc 

gagcaaggttggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtggacaagt 

gggtggactcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctattgcggt 

gttggctga 
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SEQIDNO.:66 

Met Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Tip Asp Val Pro Gly 
Gly Gly He Tip Tip Gly Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Tip He 
Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu 
Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met He 
Asn Thr Ala His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Tip Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr 
Pro Asp He Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Tip Ala Ser Gin Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val 
Lys Asp Trp Leu Asn Trp Tip Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val 
Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Tip Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Arg Leu Lys Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Ser He Val Tyr Tyr 
Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn 
Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr 
Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Gl^ Ala Pro 
Ala His Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.:67 

atggccaagtacctggagctcgaagagggcggggtcataatgcaggcgttctactgggacgtgccttcgggaggaatatggtgggacacaat 

acggcagaagataccggagtggtacgatgccggaatctccgcaatatggattcctcccgcgagcaagggtatgagcggcggctattcgatgg 

gctacgacccctacgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacat 

gataaacacggcccatgcctacggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttc 

gttggggactacacctggacggacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaag^^ 

gtgacgagggcacatttggaggcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagctacg 

ccgcctacctaaggagcatcggcgttgatgcctggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggactggt 

ggggaggctgggcxgtcggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcga 

ccgctctactacaagatggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgc 

gacccgttcaaggctgtgacg^ttgtagccaaccacgataccgatataatctggaacaagtatccagcctacgcgttcatcctcacctacgaggg 



gaagcacgagcatagtttactacgacagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacct 
cggctcgagcaaggttggaaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtg 
gacaagtgggtggactcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactccgtctggagc^ 
ctgcgtggtgggctga 



SEQroNO.:68 

Met Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Tip Asp Val Pro Ser 
Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He Trp 
He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Tyr Tyr Gb Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met 
He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe Gly Gly 
Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser Tyr Ala 
Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val 
Val Lys Asp Trp Leu Asp Tip Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala 
Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu 
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Ala Phe Asp Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg 
Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala 
Tyr Ala Phe lie Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn 
Lys Asp Lys Leu Lys Asn Leu He Tip He His Asp Asn Leu Ala Gly Gly Ser Thr Ser He Val Tyr 
Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He 
Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala 
Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Val Val Gly 

SEQ ID NO.: 73 

atggctctggaagagggcgggcttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccatagcccagaagat 

acccgactgggcgagcgccgggatttcggcaatatggattcctcccgcgagtaagggcatgagcggcggctattcgatgggctacgacccct 

acgatttcttcgacctcggtgagtactaccagaagggaagcgttgagacccgcttcggatcaaaagaggagcttgtgaacatgataaacaccgc 

ccatgctcacaacatgaaggtcatagcggacatagtcatcaaccaccgcgccggcggcgacctggagtggaatcctttcaccaacagctacac 

ctggaccgatttctcgaaggtcgcgtcgggcaagtacacggccaactacctcgacttccacccgaacgagcttcacgcgggcgattccggaa 

catttggaggctatcccgacatatgccacgacaagagctgggaccagcactggctctgggccagcaacgaaagctacgccgcctacctccgg 

agcatcggcatcgacgcctggcgcttcgactacgtcaagggctacgctccctgggtcgttaagaactggctgaaccggtggggcggctgggc 

ggttggagagtactgggacaccaacgtcgatgcactcctgagctgggcctacgacagcggtgctaaagtcttcgacttcccgctctactacaag 

atggacgaggccttcgataacaacaacatccccgccctcgtggacgccctcaagaacggaggcacggtcgtcagccgcgacccgttcaaag 

ccgtgaccttcgttgccaaccacgataccaacataatctggaacaagtatccggcctacgccttcatcctcacctatgagggacagccggcaat 

attctaccgcgactacgaggagtggctcaacaaggacaggctcaggaacctcatctggatacacgaccacctcgcgggaggaagcacagac 

atcatctactacgacagcgacgagcttatcttcgtgagaaacggctacggggacaagccgggactgataacctacatcaacctcggctcaagc 

aaggccggaaggtgggtctacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaacctcggcggctggattgacaagtgggt 

tgactcaagcggtcgggtctaccttgaggcccccgcccacgacccggccaacggccagtacggctactccgtatggagctactgcggtgttg 

ggtga 

SEQ ID NO.: 74 

Met Ala Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Tip Asp Val Pro Gly Gly Gly He Trp 
Tip Asp Thr He Ala Gin Lys He Pro Asp Tip Ala Ser Ala Gly He Ser Ala He Trp He Pro Pro Ala 
Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp Leu Gly Glu Tyr 
Tyr Gin Lys Gly Ser Val Glu Thr Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met He Asn Thr Ala 
His Ala His Asn Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp Leu Glu Tip 
Asn Pro Phe Thr Asn Ser Tyr Thr Tvp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn 
Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr Pro Asp He 
Cys His Asp Lys Ser Trp Asp Gin His Trp Leu Trp Ala Ser Asn Glu Ser Tyr Ala Ala Tyr Leu Arg 
Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val Lys Asn Tip 
Leu Asn Arg Trp Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp Ala Leu Leu Ser Trp 
Ala Tyr Asp Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp 
Asn Asn Asn He Pro Ala Leu Val Asp Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg Asp Pro Phe 
Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asn He He Tip Asn Lys Tyr Pro Ala Tyr Ala Phe He 
Leu Thr Tyr Glu Gly Gin Pro Ala He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Arg Leu 
Arg Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He He Tyr Tyr Asp Ser Asp Glu 
Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr Tyr He Asn Leu Gly Ser Ser 
Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Asn Leu 
Gly Gly Trp He Asp Lys Trp Val Asp Ser Ser Gly Arg Val Tyr Leu Glu Ala Pro Ala His Asp Pro 
Ala Asn Gly Gin Tyr Gly Tyr Ser Val Tip Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 75 

atggctctggaagagggcgggcttataatgcaggcattctactgggacgtccccatgggaggaatctggtgggacacgatagcccagaagat 
acccgactgggcaagcgccgggatttcggcgatatggattccccccgcgagcaagggtatgagcggcggctattcgatgggctacgacccct 
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acgattattngacctcggtgagtactaccagaagggaacggtggaaacaagattcggctcaaagcaggagctcataaacatgataaacaccg 

cccacgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggcgatctggagtggaaccccttcgtgaacgactata 

cctggaccgacttctcgaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccacgcgggcgattccgga 

acatttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcggcctatctcag 

gagcatcggcatcgacgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcagggactggctgaactggtggggaggctggg 

cagttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcgccctctactacaag 

atggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggccagacagtggtcagccgcgacccgttcaaggc 

tgtgacgtttgtagccaaccacgataccgacataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccagccgacaatat 

tctaccgcgactacgaggagtggctcaacaaggacaagctcaagaacctcatctggatacatgacaacctcgccggagggagcactgacatc 

gtttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctcggctcaagcaaa 

gccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaacctcggcggctgggtggacaagtgggtgga 

ctcaagcggctgggtttacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtttggagctattgcggtgttgggtga 

SEQIDNO.:76 

Met Ala Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met Gly Gly He Tip 
Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Tip He Pro Pro Ala 
Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu Gly Glu Tyr 
Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met He Asn Thr Ala 
His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp Leu Glu Trp 
Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn 
Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr Pro Asp He 
Cys His Asp Lys Ser Tip Asp Gin Tyr Tip Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala Tyr Leu Arg 
Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val Arg Asp Trp 
Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val Leu Asn Trp 
Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn 
Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp Pro Phe Lys 
Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr Ala Phe He 
Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leii Asn Lys Asp Lys Leu 
Lys Asn Leu He Trp He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr Asp Asn Asp 
Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn Leu Gly Ser Ser 
Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Asn Leu 
Gly Gly Tip Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro Ala His Asp Pro 
Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQIDNO.:77 

atggctctggaagagggcgggctcataatgcaggccttctactgggacgtccccatgggaggaatctggtgggacacgatagcccagaagat 

acccgactgggcaagcgccgggatttcggcgatatggatccctcccgcgagcaagggtatgagcggcggctattcgatgggctacgacccct 

acgattattttgacctcggtgagtactaccagaagggaacggtggaaacgaggttcggctcaaagcaggagctcataaacatgataaacaccg 

ccc^cgcctatggcatgaaggtaatagccgatatagtcatcaaccaccgcgccggcggtgacctggagtggaaccccttcgtgaac^^ 

cctggaccgacttctcaaaggtcgcgtcgggtaaatacacggccaactacctcgacttccacccgaacgagctccatgcgggcgattccggaa 

catttggaggctatcccgacatatgccacgacaagagctgggaccagtactggctctgggccagccaggagagctacgcggcatatctcagg 

agcatcggcatcgatgcctggcgcttcgactacgtcaagggctatgctccctgggtcgtcaaggactggctgaactggtggggaggctgggc 

ggttggagagtactgggacaccaacgtcgacgctgttctcaactgggcatactcgagcggtgccaaggtctttgacttcgccctctactacaaga 

tggacgaggccttcgataacaacaacattcccgccctggtggacgccctcagatacggtcagacagtggtcagccgcgacccgttcaaggct 

gtgacgtttgtagccaaccacgataccgacataatctggaacaagtatccagcctacgcgttcatcctcacctacgagggccagccgacaatatt 

ctaccgcgactacgaggagtggctcaacaaggataagctcaagaacctcatctggatacatgacaacctcgccggagggagcactgacatcg 

tttactacgacaacgacgagctgatattcgtgagaaacggctacggaagcaagccgggactgataacatacatcaacctcgcctcaagcaaag 

ccggaaggtgggtttacgttccgaagttcgcaggctcgtgcatacacgagtacaccggcaatctcggcggctgggtggacaagtgggtggac 

tcaagcggctgggtctacctcgaggctcctgcccacgacccggccaacggccagtacggctactccgtctggagctactgcggtgttgggtga 
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SEQ ID NO.: 78 

Met Ala Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Met Gly Gly He Tip 
Trp Asp Thr He Ala Gin Lys He Pro Asp Trp Ala Ser Ala Gly He Ser Ala He Trp He Pro Pro Ala 
Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu Gly Glu Tyr 
Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met He Asn Thr Ala 
His Ala Tyr Gly Met Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp Leu Glu Trp 
Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn 
Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly Thr Phe Gly Gly Tyr Pro Asp He 
Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala Tyr Leu Arg 
Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Ala Pro Trp Val Val Lys Asp Trp 
Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Val Leu Asn Trp 
Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Ala Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn 
Asn Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp Pro Phe Lys 
Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr Ala Phe He 
Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Lys Leu 
Lys Asn Leu He Tip He His Asp Asn Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr Asp Asn Asp 
Glu Leu He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn Leu Ala Ser Ser 
Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Asn Leu 
Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro Ala His Asp Pro 
Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 79 

atgaagcctgcgaaactcctcgtctttgtgctcgtagtctctatcctcgcggggctctacgcccagcccgcgggggcggccaagtacctggagc 

tcgaagagggcggcgtcataatgcaggcgttctactgggacgtgccttcaggaggaatatggtgggacacaatacggcagaagataccgga 

gtggtacgatgccggaatctccgcaatatggattcccccggcgagcaagggcatgggcggcgcctattcgatgggctacgacccctacgactt 

ctttgacctcggtgagtacgaccagaagggaacggtagagacgcgctttggctccaagcaggagctcgtgaacatgataaacaccgcccacg 

cctacggcatcaaggtcatcgcagacatagtaatcaaccaccgcgccggaggagaccttgagtggaaccccttcgtcaatgactacacctgga 

cggacttctcgaaggtcgcttccggcaagtacacggccaactacctcgacttccaccccaacgaggtcaagtgctgcgacgagggcacctttg 

gagggttcccggacatagcccacgagaagagctgggaccagtactggctctgggcgagcaacgagagctacgccgcctacctcaggagca 

tcggcgttgacgcatggcgcttcgactacgtcaagggctacggagcgtgggtcgtcaaggactggctggactggtggggaggctgggccgt 

cggggagtactgggacacaaacgttgatgcactgctcaactgggcctactcgagcgatgcaaaagtcttcgacttcccgctctactacaagatg 

gacgcggcctttgacaacaagaacattcccgcactcgtcgaggccctcaagaacgggggcacagtcgtcagccgcgacccgtttaaggccgt 

aaccttcgttgcaaaccacgacacggacataatttggaacaagtacccggcctacgccttcatcctcacctacgagggccagccgacgatattc 

taccgcgactacgaggagtggctcaacaaggacaggctcaagaacctcatctggatacacgaccacctcgccggtggaagcaccgacatag 

tctactacgataacgatgaactcatcttcgtcaggaacggctacggggacaagccggggcttataacctacatcaacctaggctcgagcaagg 

ccgggaggtgggtctacgttccgaagttcgcgggagcgtgcatccacgagtacaccggcaacctcggcggctgggtggacaagtgg^ 

ctcaagcgggtgggtgtacctcgaggcccctgcccacgacccggccaacggctattacggctactccgtctggagctactgcggggtg 

ga 

SEQ ID NO.: 80 

Met Lys Pro Ala Lys Leu Leu Val Phe Val Leu Val Val Ser He Leu Ala Gly Leu Tyr Ala Gin Pro 
Ala Gly Ala Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Tip Asp Val 
Pro Ser Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala 
He Trp He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe 
Phe Asp Leu Gly Glu Tyr Asp Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu Val 
Asn Met He Asn Thr Ala His Ala Tyr Gly He Lys Val lie Ala Asp He Val He Asn His Arg Ala Gly 
Gly Asp Leu Glu Trp Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly 
Lys Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe 
Gly Gly Phe Pro Asp He Ala His Glu Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Asn Glu Ser 
Tyr Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala 
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Trp Val Val Lys Asp Trp Leu Asp Tip Trp Gly Gly Tip Ala Val Gly Glu Tyr Trp Asp Thr Asn Val 
Asp Ala Leu Leu Asn Trp Ala Tyr Ser Ser Asp Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met 
Asp Ala Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn Gly Gly Thr Val Val 
Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr 
Pro Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr Glu Glu Trp 
Leu Asn Lys Asp Arg Leu Lys Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He 
Val Tyr Tyr Asp Asn Asp Glu Leu He Phe Val Arg Asn Gly Tyr Gly Asp Lys Pro Gly Leu He Thr 
Tyr He Asn Leu Gly Ser Ser Lys Ala Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He 
His Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Lys Trp Val Asp Ser Ser Gly Tip Val Tyr Leu 
Glu Ala Pro Ala His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 81 

atgaagaagtttgtcgccctgttcataaccatgtttttcgtagtgagcatggcagtcgttgcacagccagctagcgccgcaaagtattccgagctc 

gaagaaggcggcgttataatgcaggccttctactgggacgtcccaggtggaggaatctggtgggacaccatcaggagcaagataccggagt 

ggtacgaggcgggaatatccgccatttggattccgccagccagcaaggggatgagcggcggttactcgatgggctacgatccctacgatttctt 

tgacctcggcgagtacaaccagaagggaaccatcgaaacgcgctttggctctaaacaggagctcatcaatatgataaacacggcccatgccta 

cggcataaaggtcatagcggacatcgtcataaaccaccgcgcaggcggagacctcgagtggaacccgttcgttggggactacacctggacg 

gacttctcaaaggtggcctcgggcaaatatactgccaactacctcgacttccaccccaacgaggtcaagtgctgtgacgagggcacatttggag 

gcttcccagacatagcccacgagaagagctgggaccagcactggctctgggcgagcgatgagagctacgccgcctacctaaggagcatcg 

gcgttgatgcctggcgctttgactacgtgaagggctacggagcgtgggtcgtcaaggactggctcaactggtggggcggctgggccgttggc 

gagtactgggacaccaacgttgatgcactcctcaactgggcctactcgagcggcgccaaggtcttcgacttcccgctctactacaagatggatg 

aggcctttgacaacaaaaacattccagcgctcgtctctgcccttcagaacggccagactgttgtctcccgcgacccgttcaaggccgtaaccttt 

gtagcaaaccacgacaccgatataatctggaacaagtaccttgcttatgctttcatcctcacctacgaaggccagcccgtcatattctaccgcgac 

tacgaggagtggctcaacaaggacaggttgaacaacctcatatggatacacgaccacctcgcaggtggaagcacgagcatagtctactacga 

cagcgacgagatgatcttcgtgaggaacggctatggaagcaagcctggccttataacttacatcaacctcggctcgagcaaggttggaaggtg 

ggtttatgtgccgaagttcgcgggcgcgtgcatccacgagtatactggtaacctcggaggctgggtagacaagtacgtctactcaagcggctg 

ggtctatctcgaagctccagcttacgaccctgccaacgggcagtatggctactccgtgtggagctattgcggtgttgggtga 

SEQ ID NO.: 82 

Met Lys Lys Phe Val Ala Leu Phe He Thr Met Phe Phe Val Val Ser Met Ala Val Val Ala Gin Pro 
Ala Ser Ala Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val 
Pro Gly Gly Gly He Trp Trp Asp Thr He Arg Ser Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala 
lie Trp He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe 
Phe Asp Leu Gly Glu Tyr Asn Gin Lys Gly Thr He Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He 
Asn Met He Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly 
Gly Asp Leu Glu Trp Asn Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly 
Lys Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr Phe 
Gly Gly Phe Pro Asp He Ala His Glu Lys Ser Tip Asp Gin His Trp Leu Trp Ala Ser Asp Glu Ser 
Tyr Ala Ala Tyr Leu Arg Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala 
Trp Val Val Lys Asp Trp Leu Asn Tip Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val 
Asp Ala Leu Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met 
Asp Glu Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Ser Ala Leu Gin Asn Gly Gin Thr Val Val 
Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr 
Leu Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val lie Phe Tyr Arg Asp Tyr Glu Glu Tip 
Leu Asn Lys Asp Arg Leu Asn Asn Leu He Tip He His Asp His Leu Ala Gly Gly Ser Thr Ser He 
Val Tyr Tyr Asp Ser Asp Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu He Thr 
Tyr He Asn Leu Gly Ser Ser Lys Val Gly Arg Tip Val Tyr Val Pro Lys Phe Ala Gly Ala Cys He 
His Glu Tyr Thr Gly Asn Leu Gly Gly Tip Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu 
Glu Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 
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SEQIDNO.:83 

atggctctggaagacggcgggctcataatgcaggccttctactgggatgttcctggaggaggaatctggtgggacacaatagctcaaaagata 

cccgaatgggcaagtgcaggaatctcagcgatatggattccaccagcgagtaagggcatgagcggtggttattccatgggctacgatccctac 

gatttctttgacctcggcgagtactatcagaaggggacagttgagacgcgcttcggctcaaaggaagaactggtgaacatgataaacaccgca 

cactcctacggcataaaggtgatagcagacatagtcataaaccaccgcgccggtggagaccttgagtggaaccccttcgtgaacgactatacct 

ggacagacttctcaaaagtcgcctccggtaaatatacggccaactaccttgacttccacccaaacgagcttcactgttgtgatgaaggtacctttg 

gaggataccctgatatatgtcacgacaaaagctgggaccagtactggctctgggcgagcagcgaaagctacgctgcctacctcaggagcata 

ggggttgacgcctggcgtttcgactacgtcaagggctacggagcatgggttgttaacgactggctcagctggtggggaggctgggccgttgga 

gagtactgggacacgaacgttgatgcactcctcaactgggcatacagcagcggcgccaaggtctttgacttcccgctctactacaagatggacg 

aagccttcgacaacaccaacatcccggcattagtggatgcactcagatacggccagacagtggtcagccgcgatcccttcaaggcggtaacttt 

cgttgccaaccacgatacagatataatctggaacaagtatccggcttatgcattcatccttacctatgagggacagcctgttatattctaccgcgac 

tacgaggagtggctcaacaagga^agcttaacaacctcatctggatacacgatcaccUgctggagggagtactgacattgtttactacga 

cgacgagcttatctttgtgagaaacggctatggcaccaaaccaggactgataacctatatcaacctcggctcaagcaaagttggaaggtgggtc 

tacgttccaaagttcgccggttcatgcatccacgagtacaccggcaacctcggcggttggatagacaagtacgtctcctccagcggctgggtct 

atcttgaggccccagcccacgacccggcgaacggctactacggctactccgtatggagctactgcggggttgggtga 

SEQ ID NO. : 84 

Met Ala Leu Glu Asp Gly Gly Leu He Met Gin Ala Phe Tyr Tip Asp Val Pro Gly Gly Gly He Trp 

Trp Asp Thr lie Ala Gin Lys lie Pro Glu Trp Ala Ser Ala Gly He Ser Ala He Trp He Pro Pro Ala 
Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp Leu Gly Glu Tyr 
Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met He Asn Thr Ala 
His Ser Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp Leu Glu Trp Asn 
Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn Tyr 
Leu Asp Phe His Pro Asn Glu Leu His Cys Cys Asp Glu Gly Thr Phe Gly Gly Tyr Pro Asp He Cys 
His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Ser Glu Ser Tyr Ala Ala Tyr Leu Arg Ser 
He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val Asn Asp Trp Leu 
Ser Trp Trp Gly Gly Tip Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp Ala Leu Leu Asn Trp Ala 
Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn Thr 
Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp Pro Phe Lys Ala 
Val Thr Phe Val Ala Asn His Asp Thr Asp He He Tip Asn Lys Tyr Pro Ala Tyr Ala Phe He Leu 
Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Lys Leu Asn 
Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr Asp Ser Asp Glu 
Leu He Phe Val Arg Asn Gly Tyr Gly Thr Lys Pro Gly Leu lie Thr Tyr He Asn Leu Gly Ser Ser 
Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Asn Leu 
Gly Gly Trp He Asp Lys Tyr Val Ser Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro Ala His Asp Pro 
Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO.: 85 

atggctctggaagagggcgggcttataatgcaggcattctattgggacgtcccaggtggaggaatctggtgggacaccatagcccagaagata 

cccgaatgggcaagtgcaggaatctcagcgatatggattccaccagcgagtaagggaatgagcggtggttattccatgggctacgatccctac 

gatttctttgacctcggcgagtactatcagaaggggacagttgagacgcgcttcggctcaaaggaagaactggtgaacatgataaacaccgca 

cactcctacggcataaaggtgatagcggacatagtcataaaccaccgcgccggtggaggcctcgagtggaaccccttcgtgaacgactatacc 

tggacagacttctcaaaagtcgcctccggtaaatatacagccaactaccttgacttccacccaaacgagcttcactgttgtgatgaaggtacctttg 

gaggataccctgatatatgtcacgacaaaagctgggaccagtactggctctgggcgagcagcgaaagctacgctgcctacctcaggagcata 

ggggttgacgcctggtgtttcgactacgtcaagggctacggagcctgggttgttaacgactggctcagctggtggggaggctgggccgttgga 

gagtactgggacactaacgttgatgcactcctcaactgggcatacaacagcggcgccaaggtctttgacttcccgctctactacaagatggacg 

aagccttcgacaataccaacatccccgctttggtttacgccctcaagaatggcgggacagtggtcagccgcgacccattcaaggcggtaacttt 

cgttgccaaccacgatacagatataatctggaacaagtatccggcttatgcattcatccttacctatgagggacagcctgttatattctaccgcgac 

tacgaggagtggctcaacaaggataagcttaacaacctcatctggatacacgatcaccttgctggagggagtactgacattgtttactacgacag 

cgacgagcttatctttgtgagaaacggctatggcaccaaaccaggactgataacctatatcaacctcggctcaagcaaagctggaaggtgggtc 
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tacgttccaaagjtcgccggttcatgcatccacgagtacaccggcagcctcggcggttggatagacaagjacgtcte^^ 
accngaggccccggcccacgacccggccaatggccagtatggctactccgtctggagctattgcggggttgggtg 

SEQIDNO.:86 

Met Ala Leu Glu Glu Gly Gly Leu He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly Gly Gly He Tip 
Trp Asp Thr He Ala Gin Lys He Pro Glu Trp Ala Ser Ala Gly He Ser Ala He Trp He Pro Pro Ala 
Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp Leu Gly Glu Tyr 
Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met He Asn Thr Ala 
His Ser Tyr Gly He Lys. Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Gly Leu Glu Trp Asn 
Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn Tyr 
Leu Asp Phe His Pro Asn Glu Leu His Cys Cys Asp Glu Gly Thr Phe Gly Gly Tyr Pro Asp He Cys 
His Asp Lys Ser Trp Asp Gin Tyr Tip Leu Trp Ala Ser Ser Glu Ser Tyr Ala Ala Tyr Leu Arg Ser 
He Gly Val Asp Ala Trp Cys Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val Asn Asp Trp Leu 
Ser Trp Tip Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Leu Leu Asn Trp Ala 
Tyr Asn Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn 
Thr Asn He Pro Ala Leu Val Tyr Ala Leu Lys Asn Gly Gly Thr Val Val Ser Arg Asp Pro Phe Lys 
Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr Ala Phe He 
Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Lys Leu 
Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr Asp Ser Asp 
Glu Leu He Phe Val Arg Asn Gly Tyr Gly Thr Lys Pro Gly Leu He Thr Tyr He Asn Leu Gly Ser 
Ser Lys Ala Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Ser 
Leu Gly Gly Trp He Asp Lys Tyr Val Ser Ser Ser Gly Tip Val Tyr Leu Glu Ala Pro Ala His Asp 
Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO: 87 

atgttcctgctcgcgtttttgctcactgcctcgctgttctgcccaacaggacagcccgccaaggctgccgcaccgtttaacggcaccatgatgca 

gtattttgaatggtacttgccggatgatggcacgttatggaccaaagtggccaatgaagccaacaacmtccagccttggcatcaccgctcttt^ 

gctgccgcccgcttacaaaggaacaagccgcagcgacgtagggtacggagtatacgacttgtatgacctcggcgaattcaatcaaaaaggga 

ccgtccgcacaaaatacggaacaaaagctcaatatcttcaagccattcaagccgcccacgccgctggaatgcaagtgtacgccgatgtcgtgtt 

cgaccataaaggcggcgctgacggcacggaatgggtggacgccgtcgaagtcaatccgtccgaccgcaaccaagaaatctcgggcacctat 

caaatccaagcatggacgaaatttgattttcccgggcggggcaacacctactccagctttaagtggcgctggtaccattttgacggcgttgattgg 

gacgaaagccgaaaattgagccgcatttacaaattccgcggcatcggcaaagcgtgggattgggaagtagacacggaaaacggaaactatg 

actacttaatgtatgccgaccttgatatggatcatcccgaagtcgtgaccgagctgaaaaactgggggaaatggtatgtcaacacaacg 

gatgggttccggcttgatgccgtcaagcatattaagttcagtttttttcctgattggttgtcgtatgtgcgttctcagactggcaagccgctatt^^ 

tcggggaatattggagctatgacatcaacaagttgcacaattacattacgaaaacagacggaacgatgtctttgtttgatgccccgttacacaaca 

aattttataccgcttccaaatcagggggcgcatttgatatgcgcacgttaatgaccaatactctcatgaaagatcaaccga^^ 

cgttgataatcatgacaccgaacccggccaagcgctgcagtcatgggtcgacccatggttcaaaccgttggcttacgcctttattctaactcggc 

aggaaggatacccgtgcgtcttttatggtgactattatggcattccacaatataacattccttcgctgaaaagcaaaatcgatccgctcctcatcgc 

gcgcagggattatgcttacggaacgcaacatgattatcttgatcactccgacatcatcgggtggacaagggaaggggtcactgaaaaaccagg 

atccgggctggccgcactgatcaccgatgggccgggaggaagcaaatggatgtactgttggcaaacaacacgctggaaaagtgttcto^^ 

SEQ ID NO: 88 

Met Phe Leu Leu Ala Phe Leu Leu Thr Ala Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala 
Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr 
Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr 
Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin 
Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala 
Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val 
Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp 
Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp 
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Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr Lys Phe Arg Gly lie Gly Lys Ala Tip 
Asp Tip Glu Val Asp Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 
His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn Thr Thr Asn He Asp Gly 
Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser 
Gin Thr Gly Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys Leu His Asn Tyr 
He Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser 
Lys Ser Gly Gly Ala Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr 
Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin Ala Leu Gin Ser Trp Val Asp Pro 
Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly 
Asp Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser Lys He Asp Pro Leu Leu He Ala Arg 
Arg Asp Tyr Ala Tyr Gly Thr Gin His Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu 
Gly Val Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro Gly Gly Ser Lys Tip 
Met Tyr Cys Trp Gin Thr Thr Arg Trp Lys Ser Val Leu 

SEQ ID NO: 89 

atgaaagaagcggttgtgtatcaaattttcccggatcggttctttaatggcaacccttcaaatgataacagcaagcagcaggcacgcggggcgc 

agccgattgagcatcgcgattggtcggatttgcccgataatccgcgcctgaaagggacgagcggctacgatggcgacggtgaatggtcgaat 

gactttttcggcggagacatcgccggaattgaacaaaagttggattatttgcagtcgcttggagtgaacacgatttacttaaatccgatcgccaatg 

cgccatcgaaccataaatatgatgcgagcaattacaaagaattggatccgatgttcggttccccggaagaattccaatcgtttgtgcaggcgcttg 

cgaaccgggggatgcatctcatcttagacggggtgttcaaccacgtatccgacgattcgatttactttgaccgctaccaccgctatccgaccgtc 

ggtgcgtatgaatattgggaagcggtttacgatttgatgaatgaaaaaggattgagcgaggaagaagcgcggaaacaagtggaagagaagttc 

aaacaagagggacagacgttcagcccgtatgggtttcatctttggttcaatattgaaaacaaaaaagtcaatggccattatcaataccaatcatggt 

ggggctatgacagtctgccggagtttaagtcggtgacgggggaaaaagtgccgcatccgagtgaattgaacaacgatgcgctcgcgaattac 

attttccgtgaatcggattcggtggcgaaaagctggattgccctcggcgcctccggctggcggttggatgtggccaatgaggtggatccggcgt 

tttggcgcgagtttcgccaagaattgcttcaagggtcgtacggccgcggtccgacgttaaaagagggggagcagccgctcattttaggggaaa 

tttgggatgacgcatcgaaatattttctaggcgaccagtacgattccgtgatgaactaccggttccgcggggcggtgcttgactttttgaaaaacg 

gaaatgcagaagaggcggacaagcggctgacggccataagggaagactacccaagtgaagcgttttatgcgctgatgaacttaatcggttcg 

catgacacggcgcgggcggtctttctgcttgggaacggaacggattcatccgagcgggcggagcttgatccgaattataatgaggaacttggg 

aaaaagcggctcaagctggcggtgattttgcagatgggatacccgggagcgccgacgatttattacggcgatgaagcgggagtaacaggctc 

aaaagacccagacaaccgccgcacgtatccgtggggcaaagaagatcaaaatctgttgtcccattatcagaaagtggggcacattcgccagc 

accatcaatcgttgttggcccatggcgacatcaagacggtgtatgcgcaaggggatgtatacgtatttgcccgccaatacgggcgtgaagcgg 

cgctcattgccatcaaccgcggcaatgaggacaagacggtggcgcttgacgtcgcttcgttgcttccgaacggcaccgtgcttacggatgagtt 

gcatgatggcggggaagctacggtcgctggcggaacgttgacggtcacgattccggccctggatggacggatgatgtttgggacggtgacg 

gcggaaatgccggcagcagtcagcaatttgcaggcgagcgcttcggatggctgcgtgacgttaacgtgggaaggaaatgcatcgagatacc 

gaatttacgagtccacgttaaaaggtgccggttatacgatggtgcaagagacggaaacaacttcggccacgatcggttcgttgacgaacggaa 

cagcctattactttgccgttgcggcggtcgatgaaaacgggaatgaatcaccgaaggtcgaaacgaatcgcgtcgttcctcattacccgctgac 

gagcgacaatgtccagttcgtgacaacgttaagcgatgccacactggatttgtcaaagccgcagcaagtggatgtccatgtcaacatcgacaat 

gtgacaagcaaaggagcagctgatgggttgcaagcggtgttgcaagtgaaaggcccgcatgacgaaacatggaaagaatacagagcggctt 

accaaggacaagacggcgacgccaacgtgttccgagctgccttcactccgctcgccgcagggacgtatacgtatcgttatgcgctgacgacc 

aaccttggcgaggagtggatgtatacagaagagaagcaagtgacgtttgcggcagacaacagcgaccaaatagcgccagcagacgccatcg 

agctgcggcagcctgcggttgaatcgggacaagtgaatttatcatggacgtttgttgggaaaaaagatggggatgcttatttgttagccatcgag 

cgcaacggtgatatcgtgcatacaaccacttcgatcggcgattcatttacagactacgatgtcgaaaacggcaccgagtacacgtatgttgtcaa 

gttgtatgaccgcgccggcaatgttgtggcgtcaaacacggtcaaggtgacgccggacattgtgatggtgaaagtgatttttaaagtgagagcg 

ccggattacacaccgttggatgcccgaattacgattccgaacagcttgaacggctggaacacaggggcctgggagatgtcgcgcaacggtgc 

ggtgacgcccgattggcaatttaccgtcgaggtgcaggaaggggaaacgatcacctataagtatgtgaaaggcggatcgtgggatcaagagg 

ggttggccgaccatacgcgtgaggacgacaacgatgatgacgtgagctactacggctatgggacgattggcaccgacttgaaagtgacggtc 

cacaatgaaggaaacaatacgatgattgtgcaagaccgcattttgcgctggatcgatatgccggtcgtcatcgaagaggtgcaaaaacaagga 

agtcaagtgacgatcaagggcaatgccattaaaaacggtgttttgacgatcaatggcgagcgggtgccgattgatggccggatggcattctcgt 

acacgtttgcgccggccagccatcaaaaagaagtgttgatccatatcgaaccatcggccgaaagcaaaacagccattttcaacaacgacggcg 
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gagcgaUgcgaaaaacacaaaagattacgtgctgaatttagaaacgaagcaattcaaaaagcttctcgagagtacttctagagcggccgc^^ 
gcccatcgattttccacccgggtggggtaccaggta 

SEQIDNO:90 

Met Lys Glu Ala Val Val Tyr Gin He Phe Pro Asp Arg Phe Phe Asn Gly Asn Pro Ser Asn Asp Asn 
Ser Lys Gin Gin Ala Arg Gly Ala Gin Pro He Glu His Arg Asp Trp Ser Asp Leu Pro Asp Asn Pro 
Arg Leu Lys Gly Thr Ser Gly Tyr Asp Gly Asp Gly Glu Trp Ser Asn Asp Phe Phe Gly Gly Asp lie 
Ala Gly He Glu Gin Lys Leu Asp Tyr Leu Gin Ser Leu Gly Val Asn Thr He Tyr Leu Asn Pro He 
Ala Asn Ala Pro Ser Asn His Lys Tyr Asp Ala Ser Asn Tyr Lys Glu Leu Asp Pro Met Phe Gly Ser 
Pro Glu Glu Phe Gin Ser Phe Val Gin Ala Leu Ala Asn Arg Gly Met His Leu He Leu Asp Gly Val 
Phe Asn His Val Ser Asp Asp Ser He Tyr Phe Asp Arg Tyr His Arg Tyr Pro Thr Val Gly Ala Tyr 
Glu Tyr Tip Glu Ala Val Tyr Asp Leu Met Asn Glu Lys Gly Leu Ser Glu Glu Glu Ala Arg Lys Gin 
Val Glu Glu Lys Phe Lys Gin Glu Gly Gin Thr Phe Ser Pro Tyr Gly Phe His Leu Trp Phe Asn He 
Glu Asn Lys Lys Val Asn Gly His Tyr Gin Tyr Gin Ser Tip Trp Gly Tyr Asp Ser Leu Pro Glu Phe 
Lys Ser Val Thr Gly Glu Lys Val Pro His Pro Ser Glu Leu Asn Asn Asp Ala Leu Ala Asn Tyr He 
Phe Arg Glu Ser Asp Ser Val Ala Lys Ser Trp He Ala Leu Gly Ala Ser Gly Trp Arg Leu Asp Val 
Ala Asn Glu Val Asp Pro Ala Phe Trp Arg Glu Phe Arg Gin Glu Leu Leu Gin Gly Ser Tyr Gly Arg 
Gly Pro Thr Leu Lys Glu Gly Glu Gin Pro Leu He Leu Gly Glu He Trp Asp Asp Ala Ser Lys Tyr 
Phe Leu Gly Asp Gin Tyr Asp Ser Val Met Asn Tyr Arg Phe Arg Gly Ala Val Leu Asp Phe Leu 
Lys Asn Gly Asn Ala Glu Glu Ala Asp Lys Arg Leu Thr Ala He Arg Glu Asp Tyr Pro Ser Glu Ala 
Phe Tyr Ala Leu Met Asn Leu He Gly Ser His Asp Thr Ala Arg Ala Val Phe Leu Leu Gly Asn Gly 
Thr Asp Ser Ser Glu Arg Ala Glu Leu Asp Pro Asn Tyr Asn Glu Glu Leu Gly Lys Lys Arg Leu 
Lys Leu Ala Val He Leu Gin Met Gly Tyr Pro Gly Ala Pro Thr He Tyr Tyr Gly Asp Glu Ala Gly 
Val Thr Gly Ser Lys Asp Pro Asp Asn Arg Arg Thr Tyr Pro Trp Gly Lys Glu Asp Gin Asn Leu 
Leu Ser His Tyr Gin Lys Val Gly His He Arg Gin His His Gin Ser Leu Leu Ala His Gly Asp lie 
Lys Thr Val Tyr Ala Gin Gly Asp Val Tyr Val Phe Ala Arg Gin Tyr Gly Arg Glu Ala Ala Leu He 
Ala He Asn Arg Gly Asn Glu Asp Lys TTir Val Ala Leu Asp Val Ala Ser Leu Leu Pro Asn Gly Thr 
Val Leu Thr Asp Glu Leu His Asp Gly Gly Glu Ala Thr Val Ala Gly Gly Thr Leu Thr Val Thr He 
Pro Ala Leu Asp Gly Arg Met Met Phe Gly Thr Val Thr Ala Glu Met Pro Ala Ala Val Ser Asn Leu 
Gin Ala Ser Ala Ser Asp Gly Cys Val Thr Leu Thr Tip Glu Gly Asn Ala Ser Arg Tyr Arg He Tyr 
Glu Ser Thr Leu Lys Gly Ala Gly Tyr Thr Met Val Gin Glu Thr Glu Thr Thr Ser Ala Thr He Gly 
Ser Leu Thr Asn Gly Thr Ala Tyr Tyr Phe Ala Val Ala Ala Val Asp Glu Asn Gly Asn Glu Ser Pro 
Lys Val Glu Thr Asn Arg Val Val Pro His Tyr Pro Leu Thr Ser Asp Asn Val Gin Phe Val Thr Thr 
Leu Ser Asp Ala Thr Leu Asp Leu Ser Lys Pro Gin Gin Val Asp Val His Val Asn He Asp Asn Val 
Thr Ser Lys Gly Ala Ala Asp Gly Leu Gin Ala Val Leu Gin Val Lys Gly Pro His Asp Glu Thr Trp 
Lys Glu Tyr Arg Ala Ala Tyr Gin Gly Gin Asp Gly Asp Ala Asn Val Phe Arg Ala Ala Phe Thr Pro 
Leu Ala Ala Gly Thr Tyr Thr Tyr Arg Tyr Ala Leu Thr Thr Asn Leu Gly Glu Glu Trp Met Tyr Thr 
Glu Glu Lys Gin Val Thr Phe Ala Ala Asp Asn Ser Asp Gin He Ala Pro Ala Asp Ala He Glu Leu 
Arg Gin Pro Ala Val Glu Ser Gly Gin Val Asn Leu Ser Trp Thr Phe Val Gly Lys Lys Asp Gly Asp 
Ala Tyr Leu Leu Ala He Glu Arg Asn Gly Asp He Val His Thr Thr Thr Ser He Gly Asp Ser Phe Thr 
Asp Tyr Asp Val Glu Asn Gly Thr Glu Tyr Thr Tyr Val Val Lys Leu Tyr Asp Arg Ala Gly Asn 
Val Val Ala Ser Asn Thr Val Lys Val Thr Pro Asp He Val Met Val Lys Val He Phe Lys Val Arg 
Ala Pro Asp Tyr Thr Pro Leu Asp Ala Arg He Thr He Pro Asn Ser Leu Asn Gly Trp Asn Thr Gly 
Ala Trp Glu Met Ser Arg Asn Gly Ala Val Thr Pro Asp Trp Gin Phe Thr Val Glu Val Gin Glu Gly 
Glu Thr He Thr Tyr Lys Tyr Val Lys Gly Gly Ser Trp Asp Gin Glu Gly Leu Ala Asp His Thr Arg 
Glu Asp Asp Asn Asp Asp Asp Val Ser Tyr Tyr Gly Tyr Gly Thr He Gly Thr Asp Leu Lys Val Thr 
Val His Asn Glu Gly Asn Asn Thr Met He Val Gto Asp Arg He Leu Arg Trp He Asp Met Pro Val 
Val He Glu Glu Val Gin Lys Gin Gly Ser Gin Val Thr He Lys Gly Asn Ala He Lys Asn Gly Val 
Leu Thr He Asn Gly Glu Arg Val Pro He Asp Gly Arg Met Ala Phe Ser Tyr Thr Phe Ala Pro Ala 
Ser His Gin Lys Glu Val Leu lie His He Glu Pro Ser Ala Glu Ser Lys Thr Ala He Phe Asn Asn Asp 
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Gly Gly Ala lie Ala Lys Asn Thr Lys Asp Tyr Val Leu Asn Leu Glu Thr Lys Gin Phe Lys Lys Leu 
Leu Glu Ser Thr Ser Arg Ala Ala Ala Gly Pro Ser He Phe His Pro Gly Gly Val Pro Gly 

SEQIDN0:91 

gtgctaacgtttcaccgcatcattcgaaaaggatggatgttcctgctcgcgtttttgctcactgcctcgctgttctgcccaacaggacagcccgcca 

aggctgccgcaccgtttaacggcaccatgatgcagtattttgaatggtacttgccggatgatggcacgttatggaccaaagtggccaatgaagc 

caacaacttatccagccttggcatcaccgctctttggctgccgcccgcttataaaggaacaagccgcagcgacgtagggtacggagtatacga 

cttgtatgacctcggcgaattcaatcaaaaagggaccgtccgcacaaaatacggaacaaaagctcaatatcttcaagccattcaagccgcccac 

gccgctggaatgcaagtgtacgccgatgtcgtgttcgaccataaaggcggcgccgacggcacggaatgggtggacgccgtcgaagtcaatc 

cgtccgaccgcaaccaagaaatctcgggcacctatcaaatccaagcatggacgaaatttgattttcccgggcggggcaacacctactccagctt 

taagtggcgctggtaccattttgacggcgttgattgggacgaaagccgaaaattgagccgcatttacaaattccgcggcatcggcaaagcgtgg 

gattgggaagtagacacggaaaacggaaactatgactacttaatgtatgccgacttggacatggaccatcctgaagtggttacggaactgaaaa 

actggggcaaatggtatgtcaacacaacgaacangatgggnccggcngatgccgtcaagcatattaagncagttttmcctgaUggU 

atgtgcgUctcagactggcaagccgctatttaccgtcggggaataUggagctatgacatcaacaagttgcacaamcattacgaaaacaaacg 

gaacgatgtcmgmgatgccccgttacacaacaaattttataccgcnccaaatcagggggcgcamgatatgcgcacgttaatg^ 

ctcatgaaagatcaaccgacattggccgtcaccttcgttgataatcatgacaccgaacccggccaagcgctgcagtcatgggtcgacccatggt 

tcaaaccgUggcttacgcctttanctaactcggcaggaaggatacccgtgcgtctmatggtgactattatggcatcccacaatato^^ 

cgctgaaaagcaaaatcgatccgctcctcatcgcgcgcagggattatgcttacggaacgcaacatgattatcngatcactccg 

tggacaagggaaggcgtcactgaaaaaccaggatccggactggccgcactgatcaccgatgggccgggaggaagcaaatggatgtacgtt 

ggcaaacaacacgccggaaaagtgttctatgaccttaccggcaaccggagtgacaccgtcaccatcaacagtgatggatggggagaattcaa 

agtcaatggcggttcggmcggmgggttcctagaaaaacgaccgtctctaccatcgcttggccgatcacaacccgaccgtggactggtgaatt 

cgtccgttggaccgaaccacggttggtggcatggccttga 

SEQroNO: 92 

Val Leu Thr Phe His Arg He He Arg Lys Gly Trp Met Phe Leu Leu Ala Phe Leu Leu Thr Ala Ser 
Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe 
Glu Trp Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser 
Leu Gly He Thr Ala Leu Tip Leu Pro Pro Ala Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly 
Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 
Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met Gin Val Tyr Ala Asp Val Val 
Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
Asn Gin Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn 
Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu 
Ser Arg He Tyr Lys Phe Arg Gly He Gly Lys Ala Trp Asp Tip Glu Val Asp Thr Glu Asn Gly Asn 
Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn 
Trp Gly Lys Trp Tyr Val Asn Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys 
Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly Lys Pro Leu Phe Thr Val Gly 
Glu Tyr Trp Ser Tyr Asp He Asn Lys Leu His Asn Tyr He Thr Lys Thr Asn Gly Thr Met Ser Leu 
Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala Phe Asp Met Arg Thr 
Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp 
Thr Glu Pro Gly Gin Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He 
Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly He Pro Gin Tyr Asn He 
Pro Ser Leu Lys Ser Lys He Asp Pr9 Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu Gly Val Thr Glu Lys Pro Gly Ser Gly 
Leu Ala Ala Leu He Thr Asp Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly 
Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ser Asp Gly Trp Gly Glu 
Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp Val Pro Arg Lys Thr Thr Val Ser Thr He Ala Tip 
Pro He Thr Thr Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val Ala Tip Pro 
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SEQIDNO:93 

atgaaatcgtUgcattcatgcctatccttttttatgcaaacgatttcatcagtgaaagggaaggaggaggaaaaatggggaagaatatg^ 

agaUcacgtatttttcaatcncttattgftcgUcagctgttUcatttagtgcaaccgctagcgccaatggaacggtgaacagtagtcct^ 

tggaaacgaagtcacgmctatatggaggaacaggaaacgagcagtctgtgttactggcaggctcctttaatgatt^ 

agaagangcactaacaaaaggcgacaataacgtctggtctgtcacgcaaacacttcaagatgggacatatacgtataagmgngtagatggtc 

aatgggtggcggatccgcttaacccgaatcaagtagacgacggttacggcggccgtaatagtgtcgttgttgtcgggacaccggtgcaacaa 

aacggacagtgacgcttgttggtaacttacaagacgaattaggtcatacgagcgaatgggatccgaaagcgacagctacagtgatgaaaaagg 

aagggaacgggttatatacgtttacaggtacacttccagccggaacgtacgagtataaaattgcgattaatggcagctgggacgaa 

gtcggcggccgcgatggcgggaatattaagctgctattaaatgaacaaacaacggttacattttattacaacgacagaacgcatgcgattgcg 

attcgacttggtatgcaccaattctaaaagaaaagcagccgcggctcgttggaacgattttaccagctattggttatgaaacagacgtgaacggtt 

ggacgccgcaaacatcaacggcgngngtcagatgatgatmgaUccatttatacgtttaaggcgcgtgtgccaaaagggacatatg^ 

agtagUcttgggaatgattggacatatgaaaattatccacaagataatgccaaattaaatgtgcUgaagaaacgacaattacctttUct^ 

gaaaacgaaagtagtgtataccgattacaatccaagcggUcggatggtatcgtccaaaaagaccgmgaagcataatacgtgggattcgttgta 

tcgccaaccgmggtgcggtgaaagctgggacagaagtgacccttcgtttatcagcgaaaaaaggtgatttgacaaaagcggatgtatatgtaa 

aaaatacgacaaccggcacagcgaaactatattcgatgaaaaaagccggtgttcttggcgaagaagaatattgggaagcgacattcacaccgg 

atgtgaaaggagtatacggttataaatttattgcggtagatgctggaacgaaagcagaatacggggaagatacacaagaagggcagtgggga 

aaagcagtagataaaaatgcagagctgttccaattaacggtgtacgacccatcctaccaaacaccggattggatgaaagaagcagttgtatatca 

aattttccctgatccaaag 

SEQIDNO:94 

Met Lys Ser Phe Ala Phe Met Pro He Leu Phe Tyr Ala Asn Asp Phe He Ser Glu Arg Glu Gly Gly 
Gly Lys Met Gly Lys Asn Met Arg Arg Arg Phe Thr Tyr Phe Ser He Phe Leu Leu Phe Val Gin Leu 
Phe Ser Phe Ser Ala Thr Ala Ser Ala Asn Gly Thr Val Asn Ser Ser Pro Val Val Asn Gly Asn Glu 
Val Thr Phe Leu Tyr Gly Gly Thr Gly Asn Glu Gin Ser Val Leu Leu Ala Gly Ser Phe Asn Asp Trp 
Gin Lys Asp Gly Asp Lys Lys He Ala Leu Thr Lys Gly Asp Asn Asn Val Trp Ser Val Thr Gin Thr 
Leu Gin Asp Gly Thr Tyr Thr Tyr Lys Phe Val Val Asp Gly Gin Trp Val Ala Asp Pro Leu Asn Pro 
Asn Gin Val Asp Asp Gly Tyr Gly Gly Arg Asn Ser Val Val Val Val Gly Thr Pro Val Gin Gin Glu 
Arg Thr Val Thr Leu Val Gly Asn Leu Gin Asp Glu Leu Gly His Thr Ser Glu Trp Asp Pro Lys Ala 
Thr Ala Thr Val Met Lys Lys Glu Gly Asn Gly Leu Tyr Thr Phe Thr Gly Thr Leu Pro Ala Gly Thr 
Tyr Glu Tyr Lys He Ala He Asn Gly Ser Trp Asp Glu Asn Tyr Gly Val Gly Gly Arg Asp Gly Gly 
Asn He Lys Leu Leu Leu Asn Glu Gin Thr Thr Val Thr Phe Tyr Tyr Asn Asp Arg Thr His Ala He 
Ala Asp Ser Thr Trp Tyr Ala Pro He Leu Lys Glu Lys Gin Pro Arg Leu Val Gly Thr He Leu Pro 
Ala He Gly Tyr Glu Thr Asp Val Asn Gly Trp Thr Pro Gin Thr Ser Thr Ala Leu Leu Ser Asp Asp 
Asp Phe Asp Ser He Tyr Thr Phe Lys Ala Arg Val Pro Lys Gly Thr Tyr Glu Tyr Lys Val Val Leu 
Gly Asn Asp Trp Thr Tyr Glu Asn Tyr Pro Gin Asp Asn Ala Lys Leu Asn Val Leu Glu Glu Thr 
Thr He Thr Phe Phe Phe Asn Ala Lys Thr Lys Val Val Tyr Thr Asp Tyr Asn Pro Ser Gly Ser Asp 
Gly He Val Gin Lys Asp Arg Leu Lys His Asn Thr Trp Asp Ser Leu Tyr Arg Gin Pro Phe Gly Ala 
Val Lys Ala Gly Thr Glu Val Thr Leu Arg Leu Ser Ala Lys Lys Gly Asp Leu Thr Lys Ala Asp Val 
Tyr Val Lys Asn Thr Thr Thr Gly Thr Ala Lys Leu Tyr Ser Met Lys Lys Ala Gly Val Leu Gly Glu 
Glu Glu Tyr Trp Glu Ala Thr Phe Thr Pro Asp Val Lys Gly Val Tyr Gly Tyr Lys Phe He Ala Val 
Asp Ala Gly Thr Lys Ala Glu Tyr Gly Glu Asp Thr Gin Glu Gly Gin Trp Gly Lys Ala Val Asp Lys 
Asn Ala Glu Leu Phe Gin Leu Thr Val Tyr Asp Pro Ser Tyr Gin Thr Pro Asp Tip Met Lys Glu Ala 
Val Val Tyr Gin He Phe Pro Asp Pro Lys 

SEQID NO: 95 

atgtatacactattcatccgttcatattttgatactgatggtgatggtgtaggagactttagtggagttgctgaaaaggtagattatctaaaatctcttg 

gagtagatacagtctggtttttaccatttaataaaagtaaatcttatcatggatatgatgttgaagattactatgatgtagaaccagattatggaacact 

acaagatcttgataatatgataaaagttctaaatgaaaatggaataaaggtagtaatggatcttgttgttaatcatacgtcggatacacatccatggtt 

tcttgatgcagttgaaaatactactaattctccatattggaactattacattatgagcttggatgagcctcaaaataagaatcattggcattataaggtt 

aattcaaaaggacaaactgtgtggtattttggattgtttgattcatcaatgccggaccttaattacgacaaccctaaagtaatggatgaagtgaaaaa 
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aataatagattmgggcagatatgggagjagatggamagattagatgcagcaaaacattattatggamgactggagcgatggaa^^^ 

tcagcaagcgttgcaaaagagatagaagactatataaaagataaactaggggaaaatgcaatagttgtgagtgaggtttacgatggagattcaa 

atgttcttttaaaamgctccaatgcctgtgtttaattttagttttatgtacaamgagaggaaatmgaagggagagate 

agttgggttgattcctcgngtataatttaaatgttmcatmccatmttgatagtcatgatcngaca^^ 

aggg^^gatgtaatatctgccacaaaacaatamgctagttaatgctttactactctcattaacaggcatgccaactatttactatggtg 

gacttaggggatggaagtggcattcagaaccatgggatatacctgtgcgtgagccaatgcaatggtataaggatcaaaaagggaacggtcaaa 

cttanggacaaaagagtmacgaaggtanactgaaggaagtgctaatgaagatggagcaatetacgatgatccagatgatgga^^ 

aagaacaagaaaatggatattctattttaaacttttttaaagaatttatcaacttacgaaaagattatccggcacttgcttttggaagtactacgattga 

gagagattggaaaaacttgtatgtmgaaaaagtcgtataacttccaggatgttcttgtattaattaaccttgatccaacgtattcaaat^^ 

gUccagaagggtataaatgggtgtggtatgcatttmgatggtgacaactatgaamggagcaaaagatgaaatgatttta^ 

gacgataaatccaaggcaaatttatatatttgtaaagtaa 

SEQ ID NO: 96 

Met Tyr Thr Leu Phe He Arg Ser Tyr Phe Asp Thr Asp Gly Asp Gly Val Gly Asp Phe Ser Gly Val 
Ala Glu Lys Val Asp Tyr Leu Lys Ser Leu Gly Val Asp Thr Val Trp Phe Leu Pro Phe Asn Lys Ser 
Lys Ser Tyr His Gly Tyr Asp Val Glu Asp Tyr Tyr Asp Val Glu Pro Asp Tyr Gly Thr Leu Gin Asp 
Leu Asp Asn Met He Lys Val Leu Asn Glu Asn Gly He Lys Val Val Met Asp Leu Val Val Asn His 
Thr Ser Asp Thr His Pro Tip Phe Leu Asp Ala Val Glu Asn Thr Thr Asn Ser Pro Tyr Trp Asn Tyr 
Tyr He Met Ser Leu Asp Glu Pro Gin Asn Lys Asn His Trp His Tyr Lys Val Asn Ser Lys Gly Gin 
Thr Val Trp Tyr Phe Gly Leu Phe Asp Ser Ser Met Pro Asp Leu Asn Tyr Asp Asn Pro Lys Val Met 
Asp Glu Val Lys Lys He He Asp Phe Tip Ala Asp Met Gly Val Asp Gly Phe Arg Leu Asp Ala Ala 
Lys His Tyr Tyr Gly Phe Asp Trp Ser Asp Gly He Glu Gin Ser Ala Ser Val Ala Lys Glu He Glu 
Asp Tyr He Lys Asp Lys Leu Gly Glu Asn Ala He Val Val Ser Glu Val Tyr Asp Gly Asp Ser Asn 
Val Leu Leu Lys Phe Ala Pro Met Pro Val Phe Asn Phe Ser Phe Met Tyr Asn Leu Arg Gly Asn 
Phe Glu Gly Arg Asp Asn Leu He Ser Asp Ser He Ser Trp Val Asp Ser Ser Leu Tyr Asn Leu Asn 
Val Phe His Phe Pro Phe He Asp Ser His Asp Leu Asp Arg Phe He Ser Glu Leu Val Asp Ser Lys 
Tyr Gin Gly Asp Val He Ser Ala Thr Lys Gin Tyr Leu Leu Val Asn Ala Leu Leu Leu Ser Leu Thr 
Gly Met Pro Thr He Tyr Tyr Gly Asp Glu He Gly Leu Arg Gly Trp Lys Trp His Ser Glu Pro Trp 
Asp He Pro Val Arg Glu Pro Met Gin Tip Tyr Lys Asp Gin Lys Gly Asn Gly Gin Thr Tyr Tip Thr 
Lys Glu Phe Tyr Glu Gly He Thr Glu Gly Ser Ala Asn Glu Asp Gly Ala He Tyr Asp Asp Pro Asp 
Asp Gly Val Ser Val Glu Glu Gin Glu Asn Gly Tyr Ser He Leu Asn Phe Phe Lys Glu Phe He Asn 
Leu Arg Lys Asp Tyr Pro Ala Leu Ala Phe Gly Ser Thr Thr He Glu Arg Asp Trp Lys Asn Leu Tyr 
Val Leu Lys Lys Ser Tyr Asn Phe Gin Asp Val Leu Val Leu He Asn Leu Asp Pro Thr Tyr Ser Asn 
Thr Tyr Glu Val Pro Glu Gly Tyr Lys Trp Val Tip Tyr Ala Phe Phe Asp Gly Asp Asn Tyr Glu Phe 
Gly Ala Lys Asp Glu Met He Leu Gin Asn Thr Ser Trp Thr He Asn Pro Arg Gin He Tyr He Phe Val 
Lys 

SEQ ID NO: 97 

atgaggaagaagatgtcgcattcaagamacttttcttttgatcttagcactttttattttcttctccggttgtatttcagaagttaaaagc^^ 

ctactaaattcaaagcaaaaggtccttgtaaaagtaaatgttaatacgccatttattgagaatgctactactaatacgtggagtgtttcaaaagaatct 

tttattgattatcttagtaaagtgattattactgttaaggatgtaaatgatcagattgtatttactaaggaaacaacgaacaaaacaaatatttattttgaa 

attgaacttcttcctggaacttatacatttgaggtaaaaggatatgaggaagatttagttatattttcaggggaaaaagttaatcagatcatagatgag 

aaaaataatattgttaatgtcgaaactttttttgttaatggaatagttaggacaataattgaagttgacgatattatttataaaaattatgatattacatcgg 

caacgttgatcttcaaaaaagatacagcacaagaagattatgaagaggtacctgtaacacttacaggtacttccactttaattaataaagaattatat 

cctggtatgtggactgtaaaatttgaagttgatcttaaatcaaaggatgcaagtatgttaccagaaaaagttcatcttgaaaatgaatttagcataga 

agtgcttccagcaaagacaaaaagtttaacatttaatgtagtctttgatacagaggttaatgaaccgaaattagtagttgtatttccgcaaattgagtt 

gccttttgtggatcctgtaacaaatttaagtggagagataaatgaattagaagggaatctttcaatgaattgggactattcagatccaaatgcagaat 

tttatgtgtataaagaattagaggaacaaggagaatatttgtatgaatttgttggaaaaacacgcgagaaaagttatacaatagaaaattttaccaag 

caagaattcgataaatttagtggaatcgctattaatgtttatgccaacggtaaagagagtggattagttgttctaaaaaaagaaaatattaaacttata 

gatttagaaagtgttgacagtataagtgctacttataacgttgatacgaatgagcttaagttggattggaattataccaattcaagtgttactttt^ 
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ttttgaaaaaaggtataaatagcaatgaatacgaaataamctcaactaacacaaaaUctttUcaacagaattcacaggcaggcaat^ 

ttgagaaaattgcgattagagtagttgctaatggamgaaagtaagattaatgagatttcaagagatgatataactataacatcattgaatcttcctct 

tacatcgtctactatgtatacactattcatccgttcatatmgatactgatggtgatggtgtaggagactttagtgga^^ 

taaaatctcUggagtagatacagtctggtttttaccatttaataaaagtaaatcttatcatggatatgatgttgaaganacm^ 

tatggaacactacaagatcUgataatatgataaaagttctaaatgaaaatggaataaaggtagtaatggatcngngttaatcatacgtcgga^^ 

acatccatggmcttgatgcagttgaaaatactactaanctccatattggaactattacattatgagcUggatgagcctcaaaataagaato 

cattataaggttaattcaaaaggacaaactgtgtggtatmggattgWgattcatcaatgccggaccttaattacgacaaccctaa^ 

gaagtgaaaaaaataatagattmgggcagatatgggagtagatggatttagattagatgcagcaaaacattattatggamgactggagcgat^ 

gaattgaacagtcagcaagcgttgcaaaagagatagaagactatataaaagataaactaggggaaaatgcaatagttgtgagtgaggtttacga 

tggagattcaaatgttcttttaaaamgctccaatgcctgtgtttaattttagttttatgtacaamgagaggaaatmgaagggagagato 

tcagactctattagttgggttgattcctcgttgtataatttaaatgttmcatmccatttattgategtcatgatcttgacaga 

atagtaaatatcagggagatgtaatatctgccacaaaacaatamgctagttaatgctttactactctcattaacaggcatgc^ 

tgatgaaataggacttaggggatggaagtggcattcagaaccatgggatatacctgtgcgtgagccaatgcaatggtataaggatcaaaaagg 

gaacggtcaaacttattggacaaaagagttttacgaaggtattactgaaggaag^gctaatgaagatggagcaatatacgatgatccagatgat^ 

gagtatctgtagaagaacaagaaaatggatattctatmaaacttttttaaagaatttatcaacttacgaaaagattatccg^^ 

actacgattgagagagattggaaaaacttgtatgtmgaaaaagtcgtataacttccaggatgttcttgtattaattaaccttgatccaacgtate 

atacatacgaagttccagaagggtataaatgggtgtggtatgcatttmgatggtgacaactatgaamggagcaaaagatgaaat^^ 

aatacaagttggacgataaatccaaggcaaatttatatatttgtaaagtaa 

SEQIDNO: 98 

Met Arg Lys Lys Met Ser His Ser Arg Phe Thr Phe Leu Leu Tie Leu Ala Leu Phe lie Phe Phe Ser 
Gly Cys He Ser Glu Val Lys Ser Glu Ser Gin Leu Leu Asn Ser Lys Gin Lys Val Leu Val Lys Val 
Asn Val Asn Thr Pro Phe He Glu Asn Ala Thr Thr Asn Thr Trp Ser Val Ser Lys Glu Ser Phe He 
Asp Tyr Leu Ser Lys Val He He Thr Val Lys Asp Val Asn Asp Gin He Val Phe Thr Lys Glu Thr 
Thr Asn Lys Thr Asn He Tyr Phe Glu He Glu Leu Leu Pro Gly Thr Tyr Thr Phe Glu Val Lys Gly 
Tyr Glu Glu Asp Leu Val He Phe Ser Gly Glu Lys Val Asn Gin He He Asp Glu Lys Asn Asn He 
Val Asn Val Glu Thr Phe Phe Val Asn Gly He Val Arg Thr He He Glu Val Asp Asp He He Tyr Lys 
Asn Tyr Asp He Thr Ser Ala Thr Leu He Phe Lys Lys Asp Thr Ala Gin Glu Asp Tyr Glu Glu Val 
Pro Val Thr Leu Thr Gly Thr Ser Thr Leu He Asn Lys Glu Leu Tyr Pro Gly Met Trp Thr Val Lys 
Phe Glu Val Asp Leu Lys Ser Lys Asp Ala Ser Met Leu Pro Glu Lys Val His Leu Glu Asn Glu Phe 
Ser He Glu Val Leu Pro Ala Lys Thr Lys Ser Leu Thr Phe Asn Val Val Phe Asp Thr Glu Val Asn 
Glu Pro Lys Leu Val Val Val Phe Pro Gin He Glu Leu Pro Phe Val Asp Pro Val Thr Asn Leu Ser 
Gly Glu He Asn Glu Leu Glu Gly Asn Leu Ser Met Asn Tip Asp Tyr Ser Asp Pro Asn Ala Glu Phe 
Tyr Val Tyr Lys Glu Leu Glu Glu Gin Gly Glu Tyr Leu Tyr Glu Phe Val Gly Lys Thr Arg Glu Lys 
Ser Tyr Thr He Glu Asn Phe Thr Lys Gin Glu Phe Asp Lys Phe Ser Gly He Ala He Asn Val Tyr 
Ala Asn Gly Lys Glu Ser Gly Leu Val Val Leu Lys Lys Glu Asn He Lys Leu He Asp Leu Glu Ser 
Val Asp Ser He Ser Ala Thr Tyr Asn Val Asp Thr Asn Glu Leu Lys Leu Asp Trp Asn Tyr Thr Asn 
Ser Ser Val Thr Phe Glu Val Leu Lys Lys Gly He Asn Ser Asn Glu Tyr Glu He He Ser Gin Leu Thr 
Gin Asn Ser Phe Ser Thr Glu Phe Thr Gly Arg Gin Phe Trp Asp Leu Glu Lys He Ala He Arg Val 
Val Ala Asn Gly Phe Glu Ser Lys He Asn Glu He Ser Arg Asp Asp He Thr He Thr Ser Leu Asn Leu 
Pro Leu Thr Ser Ser Thr Met Tyr Thr Leu Phe He Arg Ser Tyr Phe Asp Thr Asp Gly Asp Gly Val 
Gly Asp Phe Ser Gly Val Ala Glu Lys Val Asp Tyr Leu Lys Ser Leu Gly Val Asp Thr Val Trp Phe 
Leu Pro Phe Asn Lys Ser Lys Ser Tyr His Gly Tyr Asp Val Glu Asp Tyr Tyr Asp Val Glu Pro Asp 
Tyr Gly Thr Leu Gin Asp Leu Asp Asn Met He Lys Val Leu Asn Glu Asn Gly lie Lys Val Val Met 
Asp Leu Val Val Asn His Thr Ser Asp Thr His Pro Trp Phe Leu Asp Ala Val Glu Asn Thr Thr Asn 
Ser Pro Tyr Trp Asn Tyr Tyr He Met Ser Leu Asp Glu Pro Gin Asn Lys Asn His Trp His Tyr Lys 
Val Asn Ser Lys Gly Gin Thr Val Trp Tyr Phe Gly Leu Phe Asp Ser Ser Met Pro Asp Leu Asn Tyr 
Asp Asn Pro Lys Val Met Asp Glu Val Lys Lys He He Asp Phe Trp Ala Asp Met Gly Val Asp Gly 
Phe Arg Leu Asp Ala Ala Lys His Tyr Tyr Gly Phe Asp Trp Ser Asp Gly He Glu Gin Ser Ala Ser 
Val Ala Lys Glu He Glu Asp Tyr He Lys Asp Lys Leu Gly Glu Asn Ala He Val Val Ser Glu Val 
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Tyr Asp Gly Asp Ser Asn Val Leu Leu Lys Phe Ala Pro Met Pro Val Phe Asn Phe Ser Phe Met Tyr 
Asn Leu Arg Gly Asn Phe Glu Gly Arg Asp Asn Leu He Ser Asp Ser He Ser Trp Val Asp Ser Ser 
Leu Tyr Asn Leu Asn Val Phe His Phe Pro Phe He Asp Ser His Asp Leu Asp Arg Phe He Ser Glu 
Leu Val Asp Ser Lys Tyr Gin Gly Asp Val He Ser Ala Thr Lys Gin Tyr Leu Leu Val Asn Ala Leu 
Leu Leu Ser Leu Thr Gly Met Pro Thr He Tyr Tyr Gly Asp Glu He Gly Leu Arg Gly Trp Lys Trp 
His Ser Glu Pro Trp Asp He Pro Val Arg Glu Pro Met Ghi Trp Tyr Lys Asp Gin Lys Gly Asn Gly 
Gin Thr Tyr Trp Thr Lys Glu Phe Tyr Glu Gly He Thr Glu Gly Ser Ala Asn Glu Asp Gly Ala He 
Tyr Asp Asp Pro Asp Asp Gly Val Ser Val Glu Glu Ghi Glu Asn Gly Tyr Ser He Leu Asn Phe Phe 
Lys Glu Phe He Asn Leu Arg Lys Asp Tyr Pro Ala Leu Ala Phe Gly Ser Thr Thr He Glu Arg Asp 
Trp Lys Asn Leu Tyr Val Leu Lys Lys Ser Tyr Asn Phe Gin Asp Val Leu Val Leu He Asn Leu Asp 
Pro Thr Tyr Ser Asn Thr Tyr Glu Val Pro Glu Gly Tyr Lys Trp Val Trp Tyr Ala Phe Phe Asp Gly 
Asp Asn Tyr Glu Phe Gly Ala Lys Asp Glu Met He Leu Gin Asn Thr Ser Tip Thr He Asn Pro Arg 
Gin He Tyr He Phe Val Lys 

SEQHJNO: 99 

atgtacacactcttcatccgctctttttacgatacaaacaacgacggtgtaggtgactacaacggtgttgcccaaaaagtagactatctcaaaacg 

cttggagtggatacagtttggttcttgccgttcaacaaagcaaaatcgtaccacggttacgatgttgaagactactacgatgtagaacctgactatg 

gaacatacgcacaacttgaaaatatgataaagacactcaatcagaacggaattcgtgttgttatggacttggttgtgaaccacacttccgatacac 

actcgtggtttctggatgccgttgagaacacaacgaattcgaaatattggagctactacataatgacacttgaaaatagagacggttggaatcact 

ggcattggaagataaactcaaaagggcaaaaagtttactacttcggactgtttgactcatcaatgcccgatttgaatttcgacaatccacaagtgat 

gaacgaaatcaagagaataatcgatttctggataacagttggtgtggatggtttcagacttgatgcaccaaagcactacaaaggctgggattggg 

acgacggcatttcaggttcagcagcaatcgcgagggaaatagaaagttacatcaggagcaagttaggaaacgatgcgatagttgtcggggaa 

gtgtacgatggaaatccatcggttctttcacaatttgcaccgatgccggcgttcaacttcacattcatgtatggaataacaggcaaccatgagggg 

aaagataacctgctgggagaaacaatttcatgggttaatggagcgagttattatctcaacgtaaaacatttcccgttcatagacaatcacgatttga 

acagatggatatcgatacttatcgaccaaaagtatagtggaaacacacaagttggtacgaagcagtatattttaacaaatgcgctcttgctttcctta 

aacggtatgcctgttatttattatgggaatgaaataggcttgagaggatggaaatggggacaagacccgtgggatttgccggtgagagagccga 

tgcagtggtacgcaagtcaaagtggagctgggcagacatggtggacaaagcctgtctaccagcaaaaaggaatcacatttggaaatgcaaac 

gtcgatggtgcgatgtacgatgatccaaatgatggggtttcagtagaagagcagatgaatggttacacgataaataacttctttaaacaattcataa 

ccctgaggaagacatatccggctctatcgaaaggttcgataacgatagaacgcgactggaagaacctgtacgttatcaaacgagtctacggaa 

atcaggaagtgcttgtattgataaacttagacccaacttggccgaacaattacacgttaccaggtggatacaggtgggtctg 

tgggagtttgtttgaatttggcaataaaaacgaatcaccactgagccaagataccaactggacagtcaatccaaggcaagtgtatgtgttt^^ 

ggactaa 

SEQroNO: 100 

Met Tyr Thr Leu Phe He Arg Ser Phe Tyr Asp Thr Asn Asn Asp Gly Val Gly Asp Tyr Asn Gly Val 
Ala Gin Lys Val Asp Tyr Leu Lys Thr Leu Gly Val Asp Thr Val Trp Phe Leu Pro Phe Asn Lys Ala 
Lys Ser Tyr His Gly Tyr Asp Val Glu Asp Tyr Tyr Asp Val Glu Pro Asp Tyr Gly Thr Tyr Ala Gin 
Leu Glu Asn Met He Lys Thr Leu Asn Gin Asn Gly He Arg Val Val Met Asp Leu Val Val Asn His 
Thr Ser Asp Thr His Ser Trp Phe Leu Asp Ala Val Glu Asn Thr Thr Asn Ser Lys Tyr Trp Ser Tyr 
Tyr He Met Thr Leu Glu Asn Arg Asp Gly Tip Asn His Trp His Trp Lys He Asn Ser Lys Gly Gin 
Lys Val Tyr Tyr Phe Gly Leu Phe Asp Ser Ser Met Pro Asp Leu Asn Phe Asp Asn Pro Gin Val 
Met Asn Glu He Lys Arg He He Asp Phe Trp He Thr Val Gly Val Asp Gly Phe Arg Leu Asp Ala 
Pro Lys His Tyr Lys Gly Trp Asp Trp Asp Asp Gly He Ser Gly Ser Ala Ala He Ala Arg Glu He Glu 
Ser Tyr He Arg Ser Lys Leu Gly Asn Asp Ala He Val Val Gly Glu Val Tyr Asp Gly Asn Pro Ser 
Val Leu Ser Gin Phe Ala Pro Met Pro Ala Phe Asn Phe Thr Phe Met Tyr Gly lie Thr Gly Asn His 
Glu Gly Lys Asp Asn Leu Leu Gly Glu Thr He Ser Trp Val Asn Gly Ala Ser Tyr Tyr Leu Asn Val 
Lys His Phe Pro Phe He Asp Asn His Asp Leu Asn Arg Trp He Ser He Leu He Asp Gin Lys Tyr Ser 
Gly Asn Thr Gin Val Gly Thr Lys Gin Tyr He Leu Thr Asn Ala Leu Leu Leu Ser Leu Asn Gly Met 
Pro Val He Tyr Tyr Gly Asn Glu He Gly Leu Arg Gly Trp Lys Trp Gly Gin Asp Pro Trp Asp Leu 
Pro Val Arg Glu Pro Met Gin Trp Tyr Ala Ser Gin Ser Gly Ala Gly Gin Thr Trp Trp Thr Lys Pro 
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Val Tyr Gin Gin Lys Gly He Thr Phe Gly Asn Ala Asn Val Asp Gly Ala Met Tyr Asp Asp Pro Asn 
Asp Gly Val Ser Val Glu Glu Gin Met Asn Gly Tyr Thr He Asn Asn Phe Phe Lys Gin Phe lie Thr 
Leu Arg Lys Thr Tyr Pro Ala Leu Ser Lys Gly Ser He Thr He Glu Arg Asp Trp Lys Asn Leu Tyr 
Val He Lys Arg Val Tyr Gly Asn Gin Glu Val Leu Val Leu He Asn Leu Asp Pro Thr Trp Pro Asn 
Asn Tyr Thr Leu Pro Gly Gly Tyr Arg Tip Val Trp Tyr Ala Phe Phe Asn Gly Ser Leu Phe Glu Phe 
Gly Asn Lys Asn Glu Ser Pro Leu Ser Gin Asp Thr Asn Trp Thr Val Asn Pro Arg Gin Val Tyr Val 
Phe Val Lys Asp 

SEQIDNO: 101 

ttgcgattctttccaaagttaatatccccttttccgcaaaacaccagagagtggcagcgaagcgcagtatcaagagacactgaacaattacaaag 

gaaagtaataatgatcaatttgaaaaaaaacaccattagcgccctggtcgcaggtatggtattaggctttgcatccaacgcaatggcggttcctag 

aaccgcttttgtacacctctttgaatggaaatgggaagatgttgcacaggagtgtgaaacatttctcggacctaaaggctttgccgcagtgcaagt 

ctctccgccaactaaatctcacaacacggatgcatggtggggccgttatcaacccgttagttatgcttttgaaggacgcagcggtaatcgcagcc 

aatttaaaaatatggtgcaacgttgtaaagctgtaggcgtcgatatatacgtagatgcagtgattaaccacatggcagcctacgacagaaatttcc 

ctgatgtaccctatagcagtaatgactttaactcctgtacaggagatattgactataataaccgttggcaaacacagcattgtgatttagtcggtctto 

atgatctaaaaacaggatctgactacgtccgccaaaaaatagcggattatatgaacgacgcaatcagtatgggtgtagctggtttccgtattgatg 

cagccaaacatataccagcaggtgatatagctgccattaaaggtaaattaaatggtaatccatacatcttccaagaggtaattggtgcatccggcg 

aacctgttcgaccgactgaatacacctttatcggtggtgtcacggaatttcaatttgctcgaaaattgggtccagccttccgcaatagtaato^ 

ggttaaaagacattggcagtcaaatggaattatccagtgctgatgccgtaacatttgtaacgaatcatgatgaagagcgtcataacccgaatggtc 

ctatttggcacggcgttcaaggtaatggttatgcattagcaaatattttcaccttagcttacccttacggctatccaaaaatcatgtcaggatac^^ 

ccacggtgactttaacgcagctccaccaagcagtggtatacacacaggaaatgcgtgtggttttgatggcggagactgggtatgcgaacacaa 

atggcgcggtattgctaacatggttgccttccgcaactatacagcaagcgaatggcgtatcagtaattggtggcaaaacagtaacgaccaaattg 

cttttggtcgcggtggtttaggttttgttgttattaataaacgtgctaatggtagcattaatcaaagttttgatacgggaatgcctgatggccaata^ 

aacataatagaagctaactttgatgaaagcaccggccaatgtagtgcagctacagattccaacggtcaagccgttattaccgtcagtggtgggca 

agctaactttaatgtagcaggcgatcatgctgctgcaattcatgttggcgcaaaaattggtgatcaatgtagtggtgatgattgcccatgtacagga 

tccgattgtaataatgatcctaaacctgattttgcagtaccagcaacatcaatttgtacatcagaaaatttacctacgctatattactggggagcaca 

gcctacagatagcttagcgaatgcagcnggccaggtgtcgcaatgcaaacaaatggcgactttaagtgtcatgatttaggt^^ 

aaattaacgccatctttagtgacaatggtgcaaataaaacagctgatctaactgttactggtgcaggttgttataaagacgggacttggagcacctt 

acaaaattgtggctttgaaattaccggtgcacaaaccaatccagtcggtggcgacgaagtctggtacttccgaggtactgctaatgactggggta 

aagcacaattagattatgacgcaactagcggtttgtattacacaatacaaagctttaatggtgaagaagcacctgcgcgtttta^^ 

agttggactgaagcttatccaacagctgattaccaagttacagataacaattcataccgcattaactttaatagcgatagcaaagcgattacag^ 

acgcacaataa 

SEQIDNO: 102 

Met Arg Phe Phe Pro Lys Leu He Ser Pro Phe Pro Gin Asn Thr Arg Glu Trp Gin Arg Ser Ala Val 
Ser Arg Asp Thr Glu Gin Leu Gin Arg Lys Val He Met He Asn Leu Lys Lys Asn Thr He Ser Ala 
Leu Val Ala Gly Met Val Leu Gly Phe Ala Ser Asn Ala Met Ala Val Pro Arg Thr Ala Phe Val His 
Leu Phe Glu Trp Lys Trp Glu Asp Val Ala Gin Glu Cys Glu Thr Phe JLeu Gly Pro Lys Gly Phe Ala 
Ala Val Gin Val Ser Pro Pro Thr Lys Ser His Asn Thr Asp Ala Trp Trp Gly Arg Tyr Gin Pro Val 
Ser Tyr Ala Phe Glu Gly Arg Ser Gly Asn Arg Ser Gin Phe Lys Asn Met Val Gin Arg Cys Lys Ala 
Val Gly Val Asp He Tyr Val Asp Ala Val lie Asn His Met Ala Ala Tyr Asp Arg Asn Phe Pro Asp 
Val Pro Tyr Ser Ser Asn Asp Phe Asn Ser Cys Thr Gly Asp He Asp Tyr Asn Asn Arg Trp Gin Thr 
Gin His Cys Asp Leu Val Gly Leu Asn Asp Leu Lys Thr Gly Ser Asp Tyr Val Arg Gin Lys He Ala 
Asp Tyr Met Asn Asp Ala He Ser Met Gly Val Ala Gly Phe Arg He Asp Ala Ala Lys His He Pro 
Ala Gly Asp He Ala Ala He Lys Gly Lys Leu Asn Gly Asn Pro Tyr He Phe Gin Glu Val He Gly Ala 
Ser Gly Glu Pro Val Arg Pro Thr Glu Tyr Thr Phe He Gly Gly Val Thr Glu Phe Gin Phe Ala Arg 
Lys Leu Gly Pro Ala Phe Arg Asn Ser Asn He Ala Trp Leu Lys Asp He Gly Ser Gin Met Glu Leu 
Ser Ser Ala Asp Ala Val Thr Phe Val Thr Asn His Asp Glu Glu Arg His Asn Pro Asn Gly Pro He 
Trp His Gly Val Gin Gly Asn Gly Tyr Ala Leu Ala Asn He Phe Thr Leu Ala Tyr Pro Tyr Gly Tyr 
Pro Lys He Met Ser Gly Tyr Phe Phe His Gly Asp Phe Asn Ala Ala Pro Pro Ser Ser Gly He His Thr 
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Gly Asn Ala Cys Gly Phe Asp Gly Gly Asp Trp Val Cys Glu His Lys Tip Arg Gly He Ala Asn Met 
Val Ala Phe Arg Asn Tyr Thr Ala Ser Glu Trp Arg He Ser Asn Trp Tip Gin Asn Ser Asn Asp Gin 
lie Ala Phe Gly Arg Gly Gly Leu Gly Phe Val Val He Asn Lys Arg Ala Asn Gly Ser He Asn Gin 
Ser Phe Asp Thr Gly Met Pro Asp Gly Gin Tyr Cys Asn He He Glu Ala Asn Phe Asp Glu Ser Thr 
Gly Gin Cys Ser Ala Ala Thr Asp Ser Asn Gly Gin Ala Val He Thr Val Ser Gly Gly Gin Ala Asn 
Phe Asn Val Ala Gly Asp His Ala Ala Ala He His Val Gly Ala Lys He Gly Asp Gin Cys Ser Gly 
Asp Asp Cys Pro Cys Thr Gly Ser Asp Cys Asn Asn Asp Pro Lys Pro Asp Phe Ala Val Pro Ala 
Thr Ser He Cys Thr Ser Glu Asn Leu Pro Thr Leu Tyr Tyr Trp Gly Ala Gin Pro Thr Asp Ser Leu 
Ala Asn Ala Ala Trp Pro Gly Val Ala Met Gin Thr Asn Gly Asp Phe Lys Cys His Asp Leu Gly Val 
Glu Leu Thr Lys He Asn Ala He Phe Ser Asp Asn Gly Ala Asn Lys Thr Ala Asp Leu Thr Val Thr 
Gly Ala Gly Cys Tyr Lys Asp Gly Thr Trp Ser Thr Leu Gin Asn Cys Gly Phe Glu He Thr Gly Ala 
Gin Thr Asn Pro Val Gly Gly Asp Glu Val Trp Tyr Phe Arg Gly Thr Ala Asn Asp Trp Gly Lys Ala 
Gin Leu Asp Tyr Asp Ala Thr Ser Gly Leu Tyr Tyr Thr He Gin Ser Phe Asn Gly Glu Glu Ala Pro 
Ala Arg Phe Lys He Asp Asn Gly Ser Trp Thr Glu Ala Tyr Pro Thr Ala Asp Tyr Gin Val Thr Asp 
Asn Asn Ser Tyr Arg He Asn Phe Asn Ser Asp Ser Lys Ala He Thr Val Asn Ala Gin 

SEQ ID NO: 103 

gtgctaacgtttcaccgcatcattcgaaaaggatggatgttcctgctcgcgtttttgctcactgcctcgctgttctgcccaacaggacagcccgcca 

aggctgccgcaccgtttaacggcaccatgatgcagtattttgaatggtacttgccggatgatggcacgttatggaccaaagtggccaatgaag 

caacaacttatccagccttggcatcaccgctctttggctgccgcccgcttacaaaggaacaagccgcagcgacgtagggtacggagtatacga 

cttgtatgacctcggcgaattcaatcaaaaagggaccgtccgcacaaaatacggaacaaaagctcaatatcttcaagccattcaagccgcccac 

gccgctggaatgcaagtgtacgccgatgtcgtgttcgaccataaaggcggcgccgacggcacggaatgggtggacgccgtcgaagtcaatc 

cgtccgaccgcaaccaagaaatctcgggcacctatcaaatccaagcatggacgaaatttgattttcccgggcggggcaacacctactccagctt 

taagtggcgctggtaccattttgacggcgttgattgggacgaaagccgaaaattgagccgcatttacaaattccgcggcatcggcaaagcgtgg 

gattgggaagtagacacggaaaacggaaactatgactacttaatgtatgccgaccttgatatggatcatcccgaagtcgtgaccgagctgaaaa 

actggggggaatggtatgtcaacacaacgaacattgatgggttccggcttgatgccgtcaagcatattaagttcagtttttttcctgattggtt^^ 

atgtgcgttctcagactggcaagccgctatttaccgtcggggaatattggagctatgacatcaacaagttgcacaattacattacgaaaacaaacg 

gaacgatgtctttgtttgatgccccgttacacaacaaattttataccgcttccaaatcagggggcgcatttgatatgcgcacgttaatgaccaatact 

ctcatgaaagatcaaccgacattggccgtcaccttcgttgataatcatgacaccgaacccggccaagcgctgcagtcatgggtcgacccatggt 

tcaaaccgttggcttacgcctttattctaactcggcaggaaggatacccgtgcgtcttttatggtgactattatggcattccacaatate 

gctgaaaagcaaaatcgatccgctcctcatcgcgcgcagggattatgcttacggaacgcaacatgattatcttgatcactccgacatcatcgggt 

ggacaagggaaggggtcactgaaaaaccaggatccgggctggccgcactgatcaccgatgggccgggaggaagcaaatggatgtacgttg 

gcaaacaacacgctggaaaagtgttctatgaccttaccggcaaccggagtgacaccgtcaccatcaacagtgatggatggggggaattcaaa 

gtcaatggcggttcggtttcggtttgggttcctagaaaaacgaccgtttctaccatcgctcggccgatcacaacccgaccgtggactggtg^^ 

gtccgttggaccgaaccacggttggtggcatggccttga 

SEQroNO: 104 

Val Leu Thr Phe His Arg He He Arg Lys Gly Trp Met Phe Leu Leu Ala Phe Leu Leu Thr Ala Ser 
Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe 
Glu Trp Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser 
Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly 
Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 
Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met Gin Val Tyr Ala Asp Val Val 
Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
Asn Gin Glu He Ser Gly Thr Tyr Gin He Gin Ala Tip Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn 
Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu 
Ser Arg lie Tyr Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn 
Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn 
Tip Gly Glu Trp Tyr Val Asn Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys 
Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly Lys Pro Leu Phe Thr Val Gly 
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Glu Tyr Trp Ser Tyr Asp He Asn Lys Leu His Asn Tyr He Thr Lys Thr Asn Gly Thr Met Ser Leu 
Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala Phe Asp Met Arg Thr 
Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp 
Thr Glu Pro Gly Gin Ala Leu Gin Ser Tip Val Asp Pro Tip Phe Lys Pro Leu Ala Tyr Ala Phe He 
Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly He Pro Gin Tyr Asn He 
Pro Ser Leu Lys Ser Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu Gly Val Thr Glu Lys Pro Gly Ser Gly 
Leu Ala Ala Leu He Thr Asp Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly 
Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ser Asp Gly Trp Gly Glu 
Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp Val Pro Arg Lys Thr Thr Val Ser Thr He Ala Arg 
Pro He Thr Thr Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val Ala Trp Pro 



SEQIDNO: 105 

atgtccctattcaaaaaaatctttccgtggattgtatctctacttcttttgttttcgtttattgctcctttttccattcaaacagaaaaagtccgcgctggaa 

gtgttccagtgaatggaacgatgatgcaatatttcgaatggtaccttccagacgatggaacactatggacgaaagtagcaaataacgcccaatct 

ttagcgaatcttggcattactgccctttggcttccccctgcctataaaggaacaagcagcagtgacgttggatatggcgtttatgatttatatgacct 

aggagagtttaatcaaaaaggaactgtccgaacaaaatacggaacaaaaacacaatatatccaagcaatccaagcggcgcatacagcaggaa 

tgcaagtatatgcagatgtcgtctttaaccataaagccggtgcagatgggacagaactagtggatgcagtagaagtaaacccttctgaccgcaat 

caagaaatatcaggaacatatcaaatccaagcgtggacaaaatttgattttcctggtcgtggaaacacctattctagttttaaatggcgttggtatca 

tttcgatggaacggactgggatgagagtagaaaactaaatcgtatttacaaattccgcggcacgggaaaagcatgggattgggaagtagataca 

gaaaatgggaattatgactatctcatgtatgcagatttggatatggatcatccagaggttgtatctgaactaaaaaattggggaaagtggtatgtaa 

ccacaaccaatatcgacggattccgtctggatgcagtgaagcatattaaatatagctttttcccagactggctatcgtatgtacgaacccaaacac 

aaaagcctctttttgccgttggcgaattttggagctatgacattaacaagctacacaactatattacaaagacgaacggctctatgtccctattcgat 

gccccgctgcataacaatttttatatagcatcgaaatcaggtggctattttgatatgcgcacattactcaacaacacattgatgaaagatcaaccaa 

cactatcggtcacattagtagacaatcacgatactgagccagggcaatctttgcagtcgtgggtcgagccgtggtttaaaccgttagcttacgcat 

ttatcttgacccgccaagaaggttatccgtgcatcttttatggagattactatggtattccaaaatacaacattcctgcgctgaaaagcaaacttgatc 

cgctgttaattgctcgaagagattatgcctacggaacacagcacgactatattgacaatgcagatattatcggctggacgcgggaaggagtagct 

gaaaaagcaaattcgggacttgctgcactcattaccgacggacctggcggaagcaaatggatgtatgttggcaaacaacacgctggcaaaac 

gttttatgatctaaccggcaatcgaagtgatacagtgacaatcaacgctgatggatggggagaatttaaagtcaatggagggtctgtatccatatg 

ggttccaaaaacatcaaccacttcccaaatcacatttactgtaaataatgccacaaccgtttggggacaaaatgtatacgttgtcgg^^ 

cagctgggcaac 

SEQHDNO: 106 

Met Ser Leu Phe Lys Lys He Phe Pro Trp He Val Ser Leu Leu Leu Leu Phe Ser Phe He Ala Pro Phe 
Ser He Gin Thr Glu Lys Val Arg Ala Gly Ser Val Pro Val Asn Gly Thr Met Met Gin Tyr Phe Glu 
Trp Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Asn Ala Gin Ser Leu Ala Asn Leu 
Gly He Thr Ala Leu Tip Leu Pro Pro Ala Tyr Lys Gly Thr Ser Ser Ser Asp Val Gly Tyr Gly Val 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
Thr Gin Tyr He Gin Ala He Gin Ala Ala His Thr Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe 
Asn His Lys Ala Gly Ala Asp Gly Thr Glu Leu Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn 
Gin Glu He Set Gly Thr Tyr Gin He Gta Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn Thr 
Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn 
Arg He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr 
Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val Ser Glu Leu Lys Asn Trp 
Gly Lys Trp Tyr Val Thr Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Tyr 
Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Thr Gin Thr Gin Lys Pro Leu Phe Ala Val Gly Glu 
Phe Trp Ser Tyr Asp He Asn Lys Leu His Asn Tyr He Thr Lys Thr Asn Gly Ser Met Ser Leu Phe 
Asp Ala Pro Leu His Asn Asn Phe Tyr He Ala Ser Lys Ser Gly Gly Tyr Phe Asp Met Arg Thr Leu 
Leu Asn Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ser Val Thr Leu Val Asp Asn His Asp Thr 
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Glu Pro Gly Gin Ser Leu Gin Ser Trp Val Glu Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu 
Thr Arg Gin Glu Gly Tyr Pro Cys lie Phe Tyr Gly Asp Tyr Tyr Gly He Pro Lys Tyr Asn He Pro 
Ala Leu Lys Ser Lys Leu Asp Pro Leu Leu lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His Asp 
Tyr He Asp Asn Ala Asp He He Gly Trp Thr Arg Glu Gly Val Ala Glu Lys Ala Asn Ser Gly Leu 
Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys 
Thr Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ala Asp Gly Trp Gly Glu Phe 
Lys Val Asn Gly Gly Ser Val Ser He Trp Val Pro Lys Thr Ser Thr Thr Ser Gin He Thr Phe Thr Val 
Asn Asn Ala Thr Thr Val Trp Gly Gin Asn Val Tyr Val Val Gly Asn He Ser Gin Leu Gly Asn 



SEQIDNO:107 

atggacagcctcgacgcgccggagcagaagccctgggtgaaggatggcaggctctccgcgtacctggatacagggacagggaccgtggtc 

gctcccgaggcacctgcgcccccgccgcccxcggccgaggaagtccggcccgtggacaagtggaaaaacgatatcatctamcgtc^^^ 

cgaccgtttccaggatggcgacaagaccaacaacatggacgtggtcccgacggacatgaaaaaatatcatggcggcgacatccaggggctc 

atcgacaagctcgactatatcaaggagaccggttcgacggccatctggctcacgccccctatgaaggggcagacccacttcttcgagaccgac 

aattaccatggttactggcccattgacttctatgacacggacccccatgtgggcaccatgcagaaatttgaggagcttatcgagaaagcccatga 

gaaagggctgaagatcgtgctcgatattcccctgaaccacacggcctgggagcatcccttctacaaggacgacagcaagaaggactggttcc 

accatataggagatgtgaaggactgggaagatccctactgggctgaaaacggctccatattcggtcttcctgacctggcgcaggaaaaccctg 

ccgtggaaaagtacctcatcgacgtggccaagttctgggtagacaagggtattgacggcttcaggcttgacgccgtgaagaacgtgcccctca 

acttctgggcgaagtttgaccgggcgattcacgattatgcgggcaaggacttcctcctcgtcggggaatactttgacggaaacccggcgaaagt 

cgcgaactaccagagagaggacatgagctcactcttcgattacccgctctactggaccctgaaggacaccttcgccaaggacgggagcatgc 

gcaacctggcggcgaagcttgatgagtgcgacaggaattatcccgacccgggcctcatgtcggttttccttgataaccacgacacgccgaggtt 

cctcaccgaggccaacggcaacaaggataagctcaaactggccctcgccttcgcgatgaccatcaaccgcatgcctaccatttattatggcacc 

gaggttgccatggaaggcaactgcgatatcatgggcgccgtagataaccggagggacatgcagtgggacaaggatcctgacatgttcaaata 

cttcaagactctcaccactgcccgcaatgagcatgaatccctcagggaaggaaagaagctcgagatgtggcaggatgacaaagtctacgcgta 

cgggaggcagaccccgaaggacgagtctatcgtggtgcttaacaacggctatgatacgcaggaacgggacataccgctccgccccgagag 

cggcatcaagaacggcacggtgctgaaggatgtcatcaccggcgaaaccgtgacggtacagaacggaaaaatccatgcgaaatgcggcgg 

caaacaggcgcggatctacgtgcccgcgtag 

SEQIDNO:108 

Met Asp Ser Leu Asp Ala Pro Glu Gin Lys Pro Trp Val Lys Asp Gly Arg Leu Ser Ala Tyr Leu Asp 
Thr Gly Thr Gly Thr Val Val Ala Pro Glu Ala Pro Ala Pro Pro Pro Pro Pro Ala Glu Glu Val Arg 
Pro Val Asp Lys Trp Lys Asn Asp He He Tyr Phe Val Leu Thr Asp Arg Phe Gin Asp Gly Asp Lys 
Thr Asn Asn Met Asp Val Val Pro Thr Asp Met Lys Lys Tyr His Gly Gly Asp He Gin Gly Leu He 
Asp Lys Leu Asp Tyr He Lys Glu Thr Gly Ser Thr Ala He Trp Leu Thr Pro Pro Met Lys Gly Gin 
Thr His Phe Phe Glu Thr Asp Asn Tyr His Gly Tyr Trp Pro He Asp Phe Tyr Asp Thr Asp Pro His 
Val Gly Thr Met Gin Lys Phe Glu Glu Leu He Glu Lys Ala His Glu Lys Gly Leu Lys He Val Leu 
Asp He Pro Leu Asn His Thr Ala Trp Glu His Pro Phe Tyr Lys Asp Asp Ser Lys Lys Asp Trp Phe 
His His He Gly Asp Val Lys Asp Trp Glu Asp Pro Tyr Trp Ala Glu Asn Gly Ser He Phe Gly Leu 
Pro Asp Leu Ala Gin Glu Asn Pro Ala Val Glu Lys Tyr Leu He Asp Val Ala Lys Phe Trp Val Asp 
Lys Gly He Asp Gly Phe Arg Leu Asp Ala Val Lys Asn Val Pro Leu Asn Phe Trp Ala Lys Phe Asp 
Arg Ala He His Asp Tyr Ala Gly Lys Asp Phe Leu Leu Val Gly Glu Tyr Phe Asp Gly Asn Pro Ala 
Lys Val Ala Asn Tyr Gin Arg Glu Asp Met Ser Ser Leu Phe Asp Tyr Pro Leu Tyr Trp Thr Leu Lys 
Asp Thr Phe Ala Lys Asp Gly Ser Met Arg Asn Leu Ala Ala Lys Leu Asp Glu Cys Asp Arg Asn 
Tyr Pro Asp Pro Gly Leu Met Ser Val Phe Leu Asp Asn His Asp Thr Pro Arg Phe Leu Thr Glu Ala 
Asn Gly Asn Lys Asp Lys Leu Lys Leu Ala Leu Ala Phe Ala Met Thr He Asn Arg Met Pro Thr He 
Tyr Tyr Gly Thr Glu Val Ala Met Glu Gly Asn Cys Asp He Met Gly Ala Val Asp Asn Arg Arg 
Asp Met Gin Trp Asp Lys Asp Pro Asp Met Phe Lys Tyr Phe Lys Thr Leu Thr Thr Ala Arg Asn 
Glu His Glu Ser Leu Arg Glu Gly Lys Lys Leu Glu Met Trp Gin Asp Asp Lys Val Tyr Ala Tyr Gly 
Arg Gin Thr Pro Lys Asp Glu Ser He Val Val Leu Asn Asn Gly Tyr Asp Thr Gin Glu Arg Asp lie 
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Pro Leu Arg Pro Glu Ser Gly He Lys Asn Gly Thr Val Leu Lys Asp Val He Thr Gly Glu Thr Val 
Thr Val Gin Asn Gly Lys He His Ala Lys Cys Gly Gly Lys Gin Ala Arg He Tyr Val Pro Ala 

SEQ ID NO: 109 

atggcaagaaaaacgctggccatattmcgtacUctagtgcncttagtctctcggcagttccggcaaaggcagaaactctagagaatgg^ 

gttataatgcaggctttctattgggatgttcctggaggaggaatctggtgggacacaatagctcaaaagatacccgaatgggcaagtgcaggaat 

ctcagcgatatggaUccaccagcgagtaagggcatgagcggtggnanccatgggctacgatccctacgatttctttgacctcggcgagt^ 

tcagaaggggacagttgagacgcgcttcggctcaaaggaagaactggtgaacatgataaacaccgcacactcctacggcataaaggtgatag 

cggacatagtcataaaccaccgcgccggtggagaccttgagtggaaccccttcgtgaacgactatacctggacagacttctcaaaagtcgcctc 

cggtaaatatacggccaactaccngacttccacccaaacgagcncactgUgtgatgaaggtacctUggaggataccctgatato^ 

caaaagctgggaccagtactggctctgggcgagcagcgaaagctacgctgcctacctcaggagcataggggttgacgcctggcgtttcgact 

acgtcaagggctacggagcatgggttgttaacgactggctcagctggtggggaggctgggccgttggagagtactgggacacgaacgttga^ 

gcactcctcaactgggcatacagcagcggcgccaaggtctttgacttcccgctctactacaagatggacgaagccttcgacaacaccaacatcc 

cggcattagtggatgcactcagatacggccagacagtggtcagccgcgatcccttcaaggcggtaacmcgttgccaaccacgatacagatat 

aatctggaacaagtatccggcnatgcaUcatccttacctatgagggacagcctgttatattctaccgcgactacgaggagtggctcaacaagga 

taagcttaacaacctcatctggatacacgatcaccUgctggagggagtactgacattgtttactacgacagcgacgagcttatctttgtgagaaac 

ggctatggcaccaaaccaggactgataacctatatcaacctcggctcaagcaaagttggaaggtgggtctacgttccaaagttcgccggttcat 

gcatccacgagtacaccggcaacctcggcggttggatagacaagtacgtctcctccagcggctgggtctatcttgaggccccagcccacgac 

ccggcgaacggctactacggctactctgtctggagctactgcggtgtgggttga 

SEQ ID NO: 110 

Met Ala Arg Lys Thr Leu Ala He Phe Phe Val Leu Leu Val Leu Leu Ser Leu Ser Ala Val Pro Ala 
Lys Ala Glu Thr Leu Glu Asn Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Gly Gly Gly 
He Trp Trp Asp Thr He Ala Gin Lys He Pro Glu Trp Ala Ser Ala Gly He Ser Ala He Trp He Pro Pro 
Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp Leu Gly Glu 
Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met He Asn Thr 
Ala His Ser Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp Leu Glu Trp 
Asn Pro Phe Val Asn Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn 
Tyr Leu Asp Phe His Pro Asn Glu Leu His Cys Cys Asp Glu Gly Thr Phe Gly Gly Tyr Pro Asp He 
Cys His Asp Lys Ser Trp Asp Gin Tyr Trp Leu Trp Ala Ser Ser Glu Ser Tyr Ala Ala Tyr Leu Arg 
Ser He Gly Val Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val Asn Asp Trp 
Leu Ser Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Tip Asp Thr Asn Val Asp Ala Leu Leu Asn Tip 
Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn 
Thr Asn He Pro Ala Leu Val Asp Ala Leu Arg Tyr Gly Gin Thr Val Val Ser Arg Asp Pro Phe Lys 
Ala Val Thr Phe Val Ala Asn His Asp Thr Asp lie He Trp Asn Lys Tyr Pro Ala Tyr Ala Phe lie 
Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Lys Leu 
Asn Asn Leu He Trp He His Asp His Leu Ala Gly Gly Ser Thr Asp He Val Tyr Tyr Asp Ser Asp 
Glu Leu He Phe Val Arg Asn Gly Tyr Gly Thr Lys Pro Gly Leu He Thr Tyr He Asn Leu Gly Ser 
Ser Lys Val Gly Arg Trp Val Tyr Val Pro Lys Phe Ala Gly Ser Cys He His Glu Tyr Thr Gly Asn 
Leu Gly Gly Trp He Asp Lys Tyr Val Ser Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro Ala His Asp 
Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 

SEQ ID NO: 111 

atgcccgcgttcaaatctaaggtgatgcacatgaagttgaagtaccttgctttagttttgttggctgtggcttcgataggcctcctctcgactccagt 

gggtgctgccaagtactccgaactcgaagagggcggtgttataatgcaggccttctactgggacgtccctaccggtgggatctggtgggacac 

cataagacagaaaatcccggagtggtacgacgctggaatctcggcgatatggattcctccagctagcaaaggtatgggtggtgcatactccatg 

ggttatgacccctacgatttctttgacctcggcgagtactatcagaagggaacagttgagacgcgcttcggctcaaaggaggaactggtgaaca 

tgataaacaccgcacactcctatggcataaaggtgatagcggacatagtcataaaccaccgcgccggcggcgacctggagtggaacccctttg 

taaacaactatacttggacagacttctccaaggtcgcctccggtaaatacacggccaactaccttgacttccacccaaacgaggtcaagtgctgc 

gatgagggtacatttggtgactttccggacatcgcccacgagaagagctgggatcagtactggctctgggcaagcaatgagagctacgccgcc 
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tatctccggagcatagggatcgatgcatggcgtttcgactacgtcaaaggttacggagcgtgggttg^taacgactggctca 

gttgggccgttggagagtactgggacaccaacgttgatgcactccttaactgggcatacaacagcggtgccaaggtctttgacttcccgctctac 

tacaagatggacgaagcctttgacaacaccaacatccccgctttggtttacgccctccagaacggaggaacagtcgtttcccgcgatcccttcaa 

ggcagtaacmcgngccaaccacgataccgatataatctggaacaagtatccggcttatgcgttcatccttacc 

tctaccgcgactacgaggagtggctcaacaaggataagcttaacaaccttatctggatacacgagcaccttgccggaggaagtaccaagatcct 

ctactacgataacgatgagctaatattcatgagggagggctacgggagcaagccgggcctcataacctacataaacctcggaaacgactggg 

ccgagcgctgggtgaacgtcggctcaaagmgccggctacacaatccatgaatacacaggcaatctcggtggctggg^^ 

agtacgacggatgggttaaactgacggcacctcctcacgatccagccaacggatattacggctactcagtctggagc^ 

ga 

SEQIDN0:112 

Met Pro Ala Phe Lys Ser Lys Val Met His Met Lys Leu Lys Tyr Leu Ala Leu Val Leu Leu Ala Val 
Ala Ser He Gly Leu Leu Set Thr Pro Val Gly Ala Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He 
Met Gin Ala Phe Tyr Trp Asp Val Pro Thr Gly Gly He Trp Tip Asp Thr He Arg Gin Lys He Pro 
Glu Trp Tyr Asp Ala Gly He Ser Ala He Tip He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser 
Met Gly Tyr Asp Pro Tyr Asp Phe Phe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr 
Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met He Asn Thr Ala His Ser Tyr Gly He Lys Val He 
Ala Asp He Val lie Asn His Arg Ala Gly Gly Asp Leu Glu Trp Asn Pro Phe Val Asn Asn Tyr Thr 
Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu 
Val Lys Cys Cys Asp Glu Gly Thr Phe Gly Asp Phe Pro Asp He Ala His Glu Lys Ser Tip Asp Gin 
Tyr Trp Leu Trp Ala Ser Asn Glu Ser Tyr Ala Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg 
Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val Asn Asp Trp Leu Ser Trp Trp Gly Gly Trp Ala 
Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Leu Leu Asn Trp Ala Tyr Asn Ser Gly Ala Lys Val 
Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp Asn Thr Asn He Pro Ala Leu Val Tyr 
Ala Leu Gin Asn Gly Gly Thr Val Val Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His 
Asp Thr Asp He He Trp Asn Lys Tyr Pro Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He 
Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys Asp Lys Leu Asn Asn Leu He Trp He His Glu His 
Leu Ala Gly Gly Ser Thr Lys He Leu Tyr Tyr Asp Asn Asp Glu Leu He Phe Met Arg Glu Gly Tyr 
Gly Ser Lys Pro Gly Leu He Thr Tyr He Asn Leu Gly Asn Asp Trp Ala Glu Arg Trp Val Asn Val 
Gly Ser Lys Phe Ala Gly Tyr Thr lie His Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Arg Trp 
Val Gin Tyr Asp Gly Trp Val Lys Leu Thr Ala Pro Pro His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr 
Ser Val Tip Ser Tyr Ala Gly Val Gly 

SEQroNO: 113 

atgaaacaacaaaaacggctttacgcccgattgctgacgctgttatttgcgctcatcttcttgctgcctcattctgcagcagcggcggcaaatctta 

atgggacgctgatgcagtattttgaatggtacatgcccaatgacggccaacattggaagcgcttgcaaaacgactcggcatatttggctgaac 

ggtattactgccgtctggattcccccggcatataagggaacgagccaagcggatgtgggctacggtgcttacgacctttatgatttaggggagttt 

catcaaaaagggacggttcggacaaagtacggcacaaaaggagagctgcaatctgcgatcaaaagtcttcattcccgcgacattaacgtttacg 

gggatgtggtcatcaaccacaaaggcggcgctgatgcgaccgaagatgtaaccgcggttgaagtcgatcccgctgaccgcaaccgcgtaatt 

tcaggagaacaccgaattaaagcctggacacattttcattttccggggcgcggcagcacatacagcgattttaaatggcattggtaccattttgac 

ggaaccgattgggacgagtcccgaaagctgaaccgcatctataagtttcaaggaaaggcttgggattgggaagtttccaatgaaaacggcaac 

tatgattatttgatgtatgccgacatcgattatgaccatcctgatgtcgcagcagaaattaagagatggggcacttggtatgccaatgaactgcaatt 

ggacggtttccgtcttgatgctgtcaaacacattaaattttcttttttgcgggattgggttaatcatgtcagggaaaaaacggggaaggaaatgttta 

cggtagctgaatattggcagaatgacttgggcgcgctggaaaactatttgaacaaaacaaattttaatcattcagtgtttgacgtgccgcttcattat 

cagttccatgctgcatcgacacagggaggcggctatgatatgaggaaattgctgaacggtacggtcgtttccaagcatccgttgaaagcggtta 

catttgtcgataaccatgatacacagccggggcaatcgcttgagtcgactgtccaaacatggtttaagccgcttgcttacgctttcattctcacaag 

ggaatctggataccctcaggttttctacggggatatgtacgggacgaaaggagactcccagcgcgaaattcctgccttgaaacacaaaattgaa 

ccgatcttaaaagcgagaaaacagtatgcgtacggagcacagcatgattatttcgaccaccatgacattgtcggctggacaagggaaggcgac 

agctcggttgcaaattcaggtttggcggcattaataacagacggacccggtggggcaaagcgaatgtatgtcggccggcaaaacgccggtga 
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gacatggcatgacattaccggaaaccgttcggagccggUgtcatcaaUcggaaggctggggagagtttcacgtaaacggcgggtc^ 
atttatgttcaaagatag 

SEQIDN0:114 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe Ala Leu lie Phe Leu Leu Pro 
His Ser Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Tip Tyr Met Pro Asn 
Asp Gly Gin His Tip Lys Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His Gly He Thr Ala Val 
Trp He Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp 
Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala 
He Lys Ser Leu His Ser Arg Asp He Asn Val Tyr Gly Asp Val Val He Asn His Lys Gly Gly Ala 
Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly Glu 
His Arg He Lys Ala Jvp Thr His Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp 
His Trp Tyr His Phe Asp Gly Thr Asp Tip Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys Phe Gin 
Gly Lys Ala Trp Asp Tip Glu Val Ser Asn Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie 
Asp Tyr Asp His Pro Asp Val Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe Leu Arg Asp Tip Val Asn His 
Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Ghi Asn Asp Leu Gly Ala 
Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr Gin Phe 
His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys 
His Pro Leu Lys Ala Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr 
Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Glu Ser Gly Tyr Pro Ghi Val 
Phe Tyr Gly Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu He Pro Ala Leu Lys His Lys He 
Glu Pro He Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His Asp Tyr Phe Asp His His Asp 
He Val Gly Trp Thr Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu He Thr Asp 
Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr Tip His Asp He Thr 
Gly Asn Arg Ser Glu Pro Val Val He Asn Ser Glu Gly Tip Gly Glu Phe His Val Asn Gly Gly Ser 
Val Ser He Tyr Val Gin Arg 

SEQ ID NO: 115 

atggcgaagtactccgagctggagcagggcggagtcataatgcaggccttctactgggacgttccggagggaggaatctggtgggacacaat 

acggcagaagatccctgaatggtacgatgcaggcatatccgccatctggatacccccggcgagcaagggcatgggcggggcctactcgatg 

ggctacgacccctacgattacttcgatctgggcgagttttaccagaagggaaccgttgagacccgcttcggctccaaggaagagctcgtcaaca 

tgatctccacggcccaccagtacggcatcaaggttatagcggacatagtgataaaccaccgcgcaggtggagacctcgaatggaacccatac 

gtcggcgactatacctggacggacttttctaaggtcgcctccgggaaatacaaggcccactacatggacttccatccaaacaactacagcacct 

cagacgagggaaccttcggtggcttcccagacattgatcacctcgtgcccttcaaccagtactggctgtgggcgagcaacgagagctacgccg 

cctacctcaggagcatagggatcgatgcgtggcgctttgactacgttaagggctacggcgcgtgggtcgtcaaggactggctgagtcagtggg 

gcggctgggccgtcggcgagtactgggacaccaacgtcgatgcgctcctcaactgggcctacagcagcggcgccaaggtcttcgacttccc 

gctctactacaagatggacgaggcctttgacaacaagaacattcccgccctcgtttacgccatccagaacggtgaaaccgtcgtcagcagggat 

cccttcaaggccgttaccttcgtggctaaccacgatacgaacataatctggaacaagtaccctgcctatgccttcatcctgacctacgaaggtcag 

cccgtcatcttctaccgcgactacgaggagtggctcaacaaggacaaactcaacaacctcatatggattcacgagcacctggcagggggaag 

caccaagatcctctactacgacgacgatgagctcatcttcatgagggaaggctacggcgacaggcccgggcttataacctacatcaacctcggt 

agcgactgggcggagagatgggtgaacgttggctcaaagttcgcgggctatacaatccacgaatacaccggaaacctcggcggctgggtcg 

acaggtacgtccagtacgacggctgggtcaagcttaccgctccgccacacgatccggcaaacggctattacggctactcggtctggagctacg 

ccggagttggaagatctcatcaccatcaccatcactaa 

SEQ ID NO: 116 

Met Ala Lys Tyr Ser Glu Leu Glu Gin Gly Gly Val He Met Gin Ala Phe Tyr Trp Asp Val Pro Glu 
Gly Gly He Trp Tip Asp Thr He Arg Gin Lys He Pro Glu Tip Tyr Asp Ala Gly He Ser Ala He Tip 
He Pro Pro Ala Ser Lys Gly Met Gly Gly Ala Tyr Ser Met Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp 
Leu Gly Glu Phe Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Glu Glu Leu Val Asn Met 
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He Ser Thr Ala His Gin Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg Ala Gly Gly Asp 
Leu Glu Tip Asn Pro Tyr Val Gly Asp Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr 
Lys Ala His Tyr Met Asp Phe His Pro Asn Asn Tyr Ser Thr Ser Asp Glu Gly Thr Phe Gly Gly Phe 
Pro Asp He Asp His Leu Val Pro Phe Asn Gin Tyr Tip Leu Trp Ala Ser Asn Glu Ser Tyr Ala Ala 
Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
Lys Asp Trp Leu Ser Gin Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn Val Asp Ala Leu 
Leu Asn Trp Ala Tyr Ser Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala 
Phe Asp Asn Lys Asn He Pro Ala Leu Val Tyr Ala He Gbi Asn Gly Glu Thr Val Val Ser Arg Asp 
Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asn He He Trp Asn Lys Tyr Pro Ala Tyr 
Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu Trp Leu Asn Lys 
Asp Lys Leu Asn Asn Leu He Tip He His Glu His Leu Ala Gly Gly Ser Thr Lys He Leu Tyr Tyr 
Asp Asp Asp Glu Leu He Phe Met Arg Glu Gly Tyr Gly Asp Arg Pro Gly Leu He Thr Tyr He Asn 
Leu Gly Ser Asp Trp Ala Glu Arg Trp Val Asn Val Gly Ser Lys Phe Ala Gly Tyr Thr He His Glu 
Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Arg Tyr Val Gin Tyr Asp Gly Trp Val Lys Leu Thr Ala 
Pro Pro His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Ala Gly Val Gly Arg Ser 
His His His His His His 

SEQIDN0:117 

ttgcgagtgttcctggttgtgccaaagctgagccgcccatttcaggcagagtcacaacaacaagacagggacataacaatgaaacacacagcg 

ggaatgctggcgatcgcaggtatgctgatcgcccccttggcgcatgccgatgtcatactgcacgccttcaactggaaatacagtgaagtcaccg 

ccaaggccgatctcatcaaggctgccggctacaagcaggtgctcatctcaccgcctctgaagtcctcgggcaacgagtggtgggctcgttacc 

agccccaggatctgcgcctggtcgacaccccccttggcaacaagcaggatctggagcagctgatcgccgcgatgcagacccggggcattgc 

cgtctacgcggacgtggtgctcaaccacatggccaacgaaagctggaagcgcagcgacctcaactaccccggcagcgagctgctgcaaag 

ctacgccggcaatccggcctactttgaacgccagaagctctttggcgatctggggcagaacttcctcgccggccaggattttcatccggagggg 

tgcatcaccgactggaacaatccgggccatgtccagtactggcgactgtgcggcggggcgggtgacaaggggctgccggatctggacccca 

acaactgggtggtgaaccagcaacaggcttacctgcaggcgctcaaggggatggggatcaagggttttcgggtcgatgcggtcaagcacatg 

agcgattaccagatcaacgccgtgttcacccccgagatcaaacaggggatgcacgtctttggcgaggtgatcaccacggggggcgccggca 

acagcgactatgagaacttcctcaaaccctacctcgacagcagcggccagggggcctacgacttcccgctcttcgcctccctgcgtggagcgc 

tgggctacggcggcagcatgaacctgctggccgatcccggtgcctatggtcaggcgctgccgggtagccgcgccgtcaccttcgccatcacc 

cacgacatccccaccaacgacggtttccgctaccagatcctcaaccagaccgacgagagactggcctatgcctacctgctcggtcgcgatggc 

ggttcgcctctggtctactccgatcacggtgaaaccagggacaaggacggattgcgctggcaggactactatctgcgcaccgatctcaaaggg 

atgatccgcttccataacacagtgcagggtcaaccgatgcagctcatcggcagtaacgactgcttcgtgctgttcaagcgtggcaagcagggc 

gtggtcggcatcaacaagtgcgactacgagcaggagtactggctcgataccgccagattcgagatgaactggtatcgcaactaccgggatgtg 

ctcgaccagaatgccgtggtcaacgtgcagagccagtgggtaaggctgaccatcccggcccgcggcgccagaatgtggctgcaggagtga 

SEQ ID NO: 118 

Met Arg Val Phe Leu Val Val Pro Lys Leu Ser Arg Pro Phe Gin Ala Glu Ser Gin Gin Gin Asp Arg 
Asp He Thr Met Lys His Thr Ala Gly Met Leu Ala He Ala Gly Met Leu He Ala Pro Leu Ala His 
Ala Asp Val He Leu His Ala Phe Asn Trp Lys Tyr Ser Glu Val Thr Ala Lys Ala Asp Leu He Lys 
Ala Ala Gly Tyr Lys Gin Val Leu He Ser Pro Pro Leu Lys Ser Ser Gly Asn Glu Trp Trp Ala Arg 
Tyr Gin Pro Gin Asp Leu Arg Leu Val Asp Thr Pro Leu Gly Asn Lys Gin Asp Leu Glu Gin Leu He 
Ala Ala Met Gin Thr Arg Gly He Ala Val Tyr Ala Asp Val Val Leu Asn His Met Ala Asn Glu Ser 
Trp Lys Arg Ser Asp Leu Asn Tyr Pro Gly Ser Glu Leu Leu Gin Ser Tyr Ala Gly Asn Pro Ala Tyr 
Phe Glu Arg Gin Lys Leu Phe Gly Asp Leu Gly Gin Asn Phe Leu Ala Gly Gin Asp Phe His Pro 
Glu Gly Cys He Thr Asp Trp Asn Asn Pro Gly His Val Gin Tyr Trp Arg Leu Cys Gly Gly Ala Gly 
Asp Lys Gly Leu Pro Asp Leu Asp Pro Asn Asn Trp Val Val Asn Gin Gin Gin Ala Tyr Leu Gin 
Ala Leu Lys Gly Met Gly He Lys Gly Phe Arg Val Asp Ala Val Lys His Met Ser Asp Tyr Gin He 
Asn Ala Val Phe Thr Pro Glu He Lys Gin Gly Met His Val Phe Gly Glu Val He Thr Thr Gly Gly 
Ala Gly Asn Ser Asp Tyr Glu Asn Phe Leu Lys Pro Tyr Leu Asp Ser Ser Gly Gin Gly Ala Tyr Asp 
Phe Pro Leu Phe Ala Ser Leu Arg Gly Ala Leu Gly Tyr Gly Gly Ser Met Asn Leu Leu Ala Asp Pro 
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Gly Ala Tyr Gly Gin Ala Leu Pro Gly Ser Arg Ala Val Thr Phe Ala He Thr His Asp He Pro Thr 
Asn Asp Gly Phe Arg Tyr Gin He Leu Asn Gin Thr Asp Glu Arg Leu Ala Tyr Ala Tyr Leu Leu Gly 
Arg Asp Gly Gly Ser Pro Leu Val Tyr Ser Asp His Gly Glu Thr Arg Asp Lys Asp Gly Leu Arg Trp 
Gin Asp Tyr Tyr Leu Arg Thr Asp Leu Lys Gly Met He Arg Phe His Asn Thr Val Gin Gly Gin Pro 
Met Gin Leu He Gly Ser Asn Asp Cys Phe Val Leu Phe Lys Arg Gly Lys Gin Gly Val Val Gly He 
Asn Lys Cys Asp Tyr Glu Gin Glu Tyr Trp Leu Asp Thr Ala Arg Phe Glu Met Asn Tip Tyr Arg 
Asn Tyr Arg Asp Val Leu Asp Gin Asn Ala Val Val Asn Val Gin Ser Gin Tip Val Arg Leu Thr He 
Pro Ala Arg Gly Ala Arg Met Trp Leu Gin Glu 

SEQIDNO: 119 

atgcaaacgtttgcancttattttactcaaagaaaggatgggtgtgcatgaattatttgaaaaaagtgtggttgtattacgctatcgtcgctaccttaa 

tcatttcctttcttacacctttttcaacagcacaagctaatactgcacctgttaacggaacaatgatgcaatatttcgaatgggacttacctaatgatgg 

gacgctttggacgaaagtaaaaaatgaagctaccaatctttcttcactaggtatcacagcactatggctccctccagcatataaaggaacgagcc 

aaagcgatgtcggatacggtgtttacgatttatatgaccttggggaatttaatcaaaaagggacgatccgaacgaaatacggaacaaaaacaca 

atatattcaagccattcaaactgcccaagccgcagggatgcaagtatatgcggatgttgtatttaatcataaggcaggggctgacagtacagaatt 

tgtcgatgcagttgaggtaaacccttctaatcgaaatcaagaaacatctggcacatatcaaattcaagcatggacaaaatttgattttcctggtcgtg 

gaaacacatactccagcttcaaatggcgctggtaccattttgatggtacggattgggacgaaagtcgtaaattaaatcgtatttacaaattccgcgg 

tacaggaaaagcgtgggactgggaagtcgatacagaaaacggaaactatgattatttaatgncgctgatttagatatggatcacc 

gacagaattaaaaaactggggaacgtggtacgtcaatactacaaatatcgatggattccgcttagatgccgtaaaacatattaaatacagctttttc 

cctgactggctaacatatgtacgtaatcaaacaggaaaaaatttamgccgttggggaattttggagctatgacgtcaataagctgcataato^ 

tacaaaaacaaatgggtcgatgtcattatttgatgcacccttgcataacaacttttataccgcttccaaatcgagtggatattttgacatgcg^ 

tgaataatacattaatgaaagatcaaccttcactcgctgtaacacttgtcgataaccacgacacgcaaccagggcaatctttacagtcatgggtcg 

aaccttggtttaaacagcttgcttacgcctttattttaacaagacaagaagggtatccttgcgtattttacggtgattattatggaatcccta^ 

atcccggggttaaaaagtaaaatcgacccgcttttaattgctcgtcgtgattacgcttatggaacacaacgtgattacattgatcatcaagacattat 

cggatggacacgagaaggcattgatgcaaaaccgaactctggactggcggctttaattaccgacggtcctggtggaagtaaatggatgtatgtc 

ggtaaaaagcatgccgggaaagtattmtgatttaactggaaatcgaagtgacacagtaacgattaatgcggatggttggggagaatttaaagta 

aacggaggatccgtctcaatttgggtggctaaaacgtcaaacgtcacatttacagtcaataacgccacaacaacaagcggacaaaacgtatatg 

ttgtcggcaacattccagagctaggcaattgtcgcacgggttaa 

SEQIDNO: 120 

Met Gin Thr Phe Ala Phe Leu Phe Tyr Ser Lys Lys Gly Trp Val Cys Met Asn Tyr Leu Lys Lys Val 
Trp Leu Tyr Tyr Ala He Val Ala Thr Leu He He Ser Phe Leu Thr Pro Phe Ser Thr Ala Gin Ala Asn 
Thr Ala Pro Val Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Asp Leu Pro Asn Asp Gly Thr Leu 
Tip Thr Lys Val Lys Asn Glu AJa Thr Asn Leu Ser Ser Leu Gly He Thr Ala Leu Tip Leu Pro Pro 
Ala Tyr Lys Gly Thr Ser Gin Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
Asn Gin Lys Gly Thr He Arg Thr Lys Tyr Gly Thr Lys Thr Gin Tyr He Gin Ala He Gin Thr Ala 
Gin Ala Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe Asn His Lys Ala Gly Ala Asp Ser Thr Glu 
Phe Val Asp Ala Val Glu Val Asn Pro Ser Asn Arg Asn Gin Glu Thr Ser Gly Thr Tyr Gin He Gin 
Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys Phe Arg Gly Thr Gly Lys 
Ala Trp Asp Tip Glu Val Asp Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Phe Ala Asp Leu Asp 
Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Thr Trp Tyr Val Asn Thr Thr Asn He 
Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Tyr Ser Phe Phe Pro Asp Trp Leu Thr Tyr Val 
Arg Asn Gin Thr Gly Lys Asn Leu Phe Ala Val Gly Glu Phe Trp Ser Tyr Asp Vai Asn Lys Leu His 
Asn Tyr He Thr Lys Thr Asn Gly Ser Met Ser Leu Phe Asp Ala Pro Leu His Asn Asn Phe Tyr Thr 
Ala Ser Lys Ser Ser Gly Tyr Phe Asp Met Arg Tyr Leu Leu Asn Asn Thr Leu Met Lys Asp Gin 
Pro Ser Leu Ala Val Thr Leu Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Gin Ser Trp Val 
Glu Pro Trp Phe Lys Gin Leu Ala Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe 
Tyr Gly Asp Tyr Tyr Gly He Pro Lys Tyr Asn He Pro Gly Leu Lys Ser Lys He Asp Pro Leu Leu He 
Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin Arg Asp Tyr He Asp His Gin Asp He He Gly Tip Thr 
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Arg Glu Gly He Asp Ala Lys Pro Asn Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro Gly Gly Set 
Lys Tip Met Tyr Val Gly Lys Lys His Ala Gly Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp 
Thr Val Thr He Asn Ala Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser He Tip Val 
Ala Lys Thr Ser Asn Val Thr Phe Thr Val Asn Asn Ala Thr Thr Thr Ser Gly Gin Asn Val Tyr Val 
Val Gly Asn He Pro Glu Leu Gly Asn Cys Arg Thr Gly 

SEQIDNO: 121 

atgctcgccctgtcgctcggcggctgcggcatcgacgcgggcccgacaggccctcgcgtcgtggagccgctgccgcagcgccccacgcttc 

cgcaggagtaccgcgccagcggccacgcggccgccggcgacgtgttcgtgcacctgttcgagtggaagtggccggacatcgcggaggaat 

gcgagaacgtgctggggccggcgggctacgaggcggtgcaggtgtcgccgccgcaggagcacctggtgcagcagggggcgccgtggtg 

gcagcggtaccagccggtgagctactcggtggcgctgagccgcagcggcacgggcgtggagttcagcaacatgatcagccggtgcaaggc 

cgccggcgtggacatctacgtggacgccgtcatcaaccacatgacggccggtgcggggacggggagcaacggcaccgcctacaccaagta 

caactaccccggcctgtacgcgcaggcggactttcacccgcagtgcgcggtgggcgactacaccagcgccgccaacgtgcaggactgcga 

actgctggggctggctgacctgaacaccggcgcggccggcgtgcagcagaagatcgcggactacctggtctcgctggcgcggctgggcgt 

ggcgggttttcgcatcgacgccgccaagcacatccagccggtggaactggacgccatcgtggaccgcgtgaaccagacgctggcggcgga 

ggggcgcccgcttccctactggttcgccgaggtgatcgacaacggcggcgagggggtgcggcgcgagcactactacggcctgggatacgg 

caccggcggcgccgcggacatcacggagttccgctacaagggcgtgggcgacaagttcctgggcagcggcggccagcggctggtggacc 

tgaagaacttctcggcggtgacgtggaacctgatgccgtcggacaaggccgtcgtctttctggagaaccacgatacgcagcgcggcggcggc 

atcggctaccgcgatggcacggcgttccggctggccaacgtgtggatgctggcgcagccgtacggctatccgtcggtgatgtccagctacgc 

ctttgaccgcacctccccctttggccgcgacgccggcccgccctccgaggacggcgcgacgaaggacgtgacgtgcgcgcccacgctgga 

gacggcggtgctgggcacctgggtgtgcgagcaccgcgaccccgtcattcagcggatggtgggctttcgccgcgcgatggcgggcacgga 

cctgaaccgctggtgggacaacggcggcaacgccattgccttttcgcgcggggaccggggcttcgtcgccatcagccgcgagccgaaggtg 

accatggcggccgtgcccagcggactgtcccccggcacctactgcgacgtgctgaccggcggcaaggtgggcaacgcctgcgcgggaac 

cagcgtgacggtcgactctcagggcgtggtgcagctgagcatcgtcgagaactcggctctggtgatccacctcggggccaagctgtaacggc 

gcgctggcggatgtgcggaggg 

SEQIDNO: 122 

Met Leu Ala Leu Ser Leu Gly Gly Cys Gly He Asp Ala Gly Pro Thr Gly Pro Arg Val Val Glu Pro 
Leu Pro Gin Arg Pro Thr Leu Pro Gin Glu Tyr Arg Ala Ser Gly His Ala Ala Ala Gly Asp Val Phe 
Val His Leu Phe Glu Trp Lys Trp Pro Asp He Ala Glu Glu Cys Glu Asn Val Leu Gly Pro Ala Gly 
Tyr Glu Ala Val Gin Val Ser Pro Pro Gin Glu His Leu Val Gin Gin Gly Ala Pro Tip Trp Ghi Arg 
Tyr Gin Pro Val Ser Tyr Ser Val Ala Leu Ser Arg Ser Gly Thr Gly Val Glu Phe Ser Asn Met He 
Ser Arg Cys Lys Ala Ala Gly Val Asp He Tyr Val Asp Ala Val He Asn His Met Thr Ala Gly Ala 
Gly Thr Gly Ser Asn Gly Thr Ala Tyr Thr Lys Tyr Asn Tyr Pro Gly Leu Tyr Ala Gin Ala Asp Phe 
His Pro Gin Cys Ala Val Gly Asp Tyr Thr Ser Ala Ala Asn Val Gin Asp Cys Glu Leu Leu Gly Leu 
Ala Asp Leu Asn Thr Gly Ala Ala Gly Val Gin Gin Lys He Ala Asp Tyr Leu Val Ser Leu Ala Arg 
Leu Gly Val Ala Gly Phe Arg He Asp Ala Ala Lys His He Gin Pro Val Glu Leu Asp Ala He Val 
Asp Arg Val Asn Gin Thr Leu Ala Ala Glu Gly Arg Pro Leu Pro Tyr Trp Phe Ala Glu Val He Asp 
Asn Gly Gly Glu Gly Val Arg Arg Glu His Tyr Tyr Gly Leu Gly Tyr Gly Thr Gly Gly Ala Ala Asp 
He Thr Glu Phe Arg Tyr Lys Gly Val Gly Asp Lys Phe Leu Gly Ser Gly Gly Gin Arg Leu Val Asp 
Leu Lys Asn Phe Ser Ala Val Thr Trp Asn Leu Met Pro Ser Asp Lys Ala Val Val Phe Leu Glu Asn 
His Asp Thr Gin Arg Gly Gly Gly He Gly Tyr Arg Asp Gly Thr Ala Phe Arg Leu Ala Asn Val Trp 
Met Leu Ala Gin Pro Tyr Gly Tyr Pro Ser Val Met Ser Ser Tyr Ala Phe Asp Arg Thr Ser Pro Phe 
Gly Arg Asp Ala Gly Pro Pro Ser Glu Asp Gly Ala Thr Lys Asp Val Thr Cys Ala Pro Thr Leu Glu 
Thr Ala Val Leu Gly Thr Tip Val Cys Glu His Arg Asp Pro Val He Gin Arg Met Val Gly Phe Arg 
Arg Ala Met Ala Gly Thr Asp Leu Asn Arg Tip Trp Asp Asn Gly Gly Asn Ala He Ala Phe Ser Arg 
Gly Asp Arg Gly Phe Val Ala He Ser Arg Glu Pro Lys Val Thr Met Ala Ala Val Pro Ser Gly Leu 
Ser Pro Gly Thr Tyr Cys Asp Val Leu Thr Gly Gly Lys Val Gly Asn Ala Cys Ala Gly Thr Ser Val 
Thr Val Asp Ser Gin Gly Val Val Gin Leu Ser He Val Glu Asn Ser Ala Leu Val He His Leu Gly 
Ala Lys Leu Arg Arg Ala Gly Gly Cys Ala Glu 
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SEQIDNO:123 

atgccccaggccattcgcactttttcacgttggacgttgttcggcttaatcggcgttmctgcttggtctcgtctmctgtccca^ 

aggcccagacaaccccggcccgtaccgttatggttcacctcttcgagtggaaatggaccgacatcgctaaagaatgcgagaamcctc^^ 

cgaaaggctttgccgcaatccaggtatcgccgccccaggagcatgtccaggggtcgcaatggtggacccgctatcagccggtcagctacaag 

atcgagagccgctccggcacccgggccgagttcgccaatatggtctcgcgctgcaaagccgtcggggtcgatatctatgtcgatgccgtgatc 

aaccatatgacgactgtcggctccggcactggtatggctggatcgacctacaccagctacacctatccggggctgtatca^^^ 

accactgcgggcgcaatggcaacgatgatatcagcagctacggcgatcgctgggaagtacaaaactgcgaactgctcaacctagccgacctc 

aacaccggcgctgagtatgtccggggtaaactcgccgcctatatgaacgatctgcgcggcctgggcgtcgccggatttcggatcgatgccgcc 

aagcacatggataccaacgacatcaacaatatcgttggccgcctgcccaacgcgccctacatctaccaggaagtgatcgaccagggcggcga 

gccaattaccgccggcgaatacttccagaatggcgatgtgaccgagttcaagtacagccgcgagatctcgcgcatgttcaaaaccggccagct 

gacccatatgagccagncggcactgcctggggcttcatgtccagcgacctggcagtagttttcaccgataaccacgacaaccagcgcggtca 

cggcggcgccggcgatgtcttgacctacaaagatggccagctgtacaccctgggcaatatcUcgagctagcctggccgtatggct^^ 

ggtcatgtcgagctacacgttcagcaacggcgaccaggggccgccatcgaccaatgtgtacgcaaccacaacgcctgattgtggcaacggcc 

gctgggtctgtgagcaccgctggcgaggaatcgccaacatggtcgcgttccgcaactacaccgccccgaccttcagcaccagcaactggtgg 

agcaacggcaacaaccagatcgctttcagccgcgggaccctgggctttgtggcgatcaatcgggaaggtggcagcctgaaccgcaccttcca 

aaccggcctgcccgtcggcacctactgcgatgtcattcacggcgatttcaatgccagcgccggcacctgttccggcccaactatcgctgtcaac 

ggctccggacaggcaaccatcacggtcaacgcgatggacgcggtggcgatctacggcggagccaggctcgccactccggccagtgtcaac 

gtgacattcaacgaaaacgccacgaccacctgggggcagaatgtgtatatcgtcggcaacgtcgccgccctgggcagctggaacgcaggca 

gcgcggtcttactctcctccgctaactacccaatctggagcaagaccatcgccctgccagccaacaccgccattgagtacaagtacatcaaaaa 

ggatggcgcgggcaatgtggtgtgggaaagcggcgccaaccgcgtctttaccacccccggcagcggcagtgccacgcgcaacgatacctg 

gaaatag 

SEQ ID NO: 124 

Met Pro Gin Ala He Arg Thr Phe Ser Arg Tip Thr Leu Phe Gly Leu He Gly Val Phe Leu Leu Gly 
Leu Val Phe Ser Val Pro Pro Arg Ala He Gin Ala Gin Thr Thr Pro Ala Arg Thr Val Met Val His 
Leu Phe Glu Trp Lys Trp Thr Asp He Ala Lys Glu Cys Glu Asn Phe Leu Gly Pro Lys Gly Phe Ala 
Ala He Gin Val Ser Pro Pro Gin Glu His Val Gin Gly Ser Gin Trp Trp Thr Arg Tyr Gin Pro Val Ser 
Tyr Lys He Glu Ser Arg Ser Gly Thr Arg Ala Glu Phe Ala Asn Met Val Ser Arg Cys Lys Ala Val 
Gly Val Asp He Tyr Val Asp Ala Val He Asn His Met Thr Thr Val Gly Ser Gly Thr Gly Met Ala 
Gly Ser Thr Tyr Thr Ser Tyr Thr Tyr Pro Gly Leu Tyr Gin Thr Gin Asp Phe His His Cys Gly Arg 
Asn Gly Asn Asp Asp He Ser Ser Tyr Gly Asp Arg Trp Glu Val Gin Asn Cys Glu Leu Leu Asn 
Leu Ala Asp Leu Asn Thr Gly Ala Glu Tyr Val Arg Gly Lys Leu Ala Ala Tyr Met Asn Asp Leu 
Arg Gly Leu Gly Val Ala Gly Phe Arg He Asp Ala Ala Lys His Met Asp Thr Asn Asp He Asn Asn 
He Val Gly Arg Leu Pro Asn Ala Pro Tyr He Tyr Gin Glu Val He Asp Gin Gly Gly Glu Pro He Thr 
Ala Gly Glu Tyr Phe Gin Asn Gly Asp Val Thr Glu Phe Lys Tyr Ser Arg Glu He Ser Arg Met Phe 
Lys Thr Gly Gin Leu Thr His Met Ser Gin Phe Gly Thr Ala Trp Gly Phe Met Ser Ser Asp Leu Ala 
Val Val Phe Thr Asp Asn His Asp Asn Gin Arg Gly His Gly Gly Ala Gly Asp Val Leu Thr Tyr 
Lys Asp Gly Gin Leu Tyr Thr Leu Gly Asn He Phe Glu Leu Ala Trp Pro Tyr Gly Tyr Pro Gin Val 
Met Ser Ser Tyr Thr Phe Ser Asn Gly Asp Gin Gly Pro Pro Ser Thr Asn Val Tyr Ala Thr Thr Thr 
Pro Asp Cys Gly Asn Gly Arg Trp Val Cys Glu His Arg Trp Arg Gly He Ala Asn Met Val Ala Phe 
Arg Asn Tyr Thr Ala Pro Thr Phe Ser Thr Ser Asn Trp Trp Ser Asn Gly Asn Asn Gin He Ala Phe 
Ser Arg Gly Thr Leu Gly Phe Val Ala He Asn Arg Glu Gly Gly Ser Leu Asn Arg Thr Phe Gin Thr 
Gly Leu Pro Val Gly Thr Tyr Cys Asp Val He His Gly Asp Phe Asn Ala Ser Ala Gly Thr Cys Ser 
Gly Pro Thr He Ala Val Asn Gly Ser Gly Gin Ala Thr He Thr Val Asn Ala Met Asp Ala Val Ala He 
Tyr Gly Gly Ala Arg Leu Ala Thr Pro Ala Ser Val Asn Val Thr Phe Asn Glu Asn Ala Thr Thr Thr 
Trp Gly Gin Asn Val Tyr He Val Gly Asn Val Ala Ala Leu Gly Ser Trp Asn Ala Gly Ser Ala Val 
Leu Leu Ser Ser Ala Asn Tyr Pro He Trp Ser Lys Thr He Ala Leu Pro Ala Asn Thr Ala He Glu Tyr 
Lys Tyr He Lys Lys Asp Gly Ala Gly Asn Val Val Trp Glu Ser Gly Ala Asn Arg Val Phe Thr Thr 
Pro Gly Ser Gly Ser Ala Thr Arg Asn Asp Thr Trp Lys 
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SEQIDNO:125 

gtggtgcacatgaagttgaagtaccttgccttagtmgttggctgtggcttcgataggcctactctcgactccagtgggtgctgccaagtactccg 

aactcgaagagggcggtgttataatgcaggccttctactgggatgttcccggagggggaatctggtgggacaccataagacagaaaatcccg 

gagtggtacgacgctggaatctcggcgatatggattcctccagctagcaaagggatgggcggtggttattccatgggctacgatccctacgattt 

ctttgacctcggcgagtactatcagaagggaacagttgagacgcgcttcggctcaaaggaggaactggtgaacatgataaacaccgcacactc 

ctatggcataaaggtgatagcggacatagtcataaaccaccgcgccggtggagaccttgagtggaacccctttgtaaacaactatacttggaca 

gacttctccaaggtcgcctccggtaaatacacggccaactaccttgacttccacccaaacgaggtcaagtgctgcgatgagggtacatttggtga 

ctttccggacatcgcccacgagaagagctgggatcagtactggctctgggcaagcaatgagagctacgccgcatatctccggagcataggga 

tcgatgcatggcgmcgactacgtcaaaggttacggagcgtgggttgttaatgactggctcagctggtggggaggctgggccgttggagagta 

ctgggacacgaacgttgatgcactccttaactgggcatacgacagcggtgccaaggtctttgacttcccgctctactacaagatggacgaagcc 

mgacaacaccaacatccccgctttggtttacgccctccagaacggaggaacagtcgtttcccgcgatcccttcaaggcagtaactttcgttgcc 

aaccacgatacagatataatctggaacaagtatccggcttatgcgttcatccttacctatgagggacagcctgttatattttaccgcgactocga^ 

agtggctcaacaaggataagcttaacaaccttatctggatacacgagcaccttgccggaggaagtaccaagatcctctactacgataacgatga 

gctaatattcatgagggagggctacgggagcaagccgggcctMtaacctacataaacctcggaaacgactgggccgagcg 

g^cggctcaaagmgccggctacacaatccatgaatacacaggcaatctcggtggctgggttgacaggtgggttcagtacgatggatgggtta 

aactgacggcacctcctcatgatccagccaacggatattacggctactcagtctggagctacgcaggcgtcggatga 

SEQIDNO: 126 

Val Val His Met Lys Leu Lys Tyr Leu Ala Leu Val Leu Leu Ala Val Ala Ser He Gly Leu Leu Ser 
Thr Pro Val Gly Ala Ala Lys Tyr Ser Glu Leu Glu Glu Gly Gly Val He Met Gin Ala Phe Tyr Tip 
Asp Val Pro Gly Gly Gly He Trp Trp Asp Thr He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He 
Ser Ala He Trp He Pro Pro Ala Ser Lys Gly Met Gly Gly Gly Tyr Ser Met Gly Tyr Asp Pro Tyr 
Asp Phe Phe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu Thr Arg Phe Gly Ser Lys Glu Glu 
Leu Val Asn Met He Asn Thr Ala His Ser Tyr Gly He Lys Val He Ala Asp He Val He Asn His Arg 
Ala Gly Gly Asp Leu Glu Trp Asn Pro Phe Val Asn Asn Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala 
Ser Gly Lys Tyr Thr Ala Asn Tyr Leu Asp Phe His Pro Asn Glu Val Lys Cys Cys Asp Glu Gly Thr 
Phe Gly Asp Phe Pro Asp He Ala His Glu Lys Ser Tip Asp Gin Tyr Trp Leu Trp Ala Ser Asn Glu 
Ser Tyr Ala Ala Tyr Leu Arg Ser He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly 
Ala Trp Val Val Asn Asp Trp Leu Ser Tip Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp Asp Thr Asn 
Val Asp Ala Leu Leu Asn Trp Ala Tyr Asp Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys 
Met Asp Glu Ala Phe Asp Asn Thr Asn He Pro Ala Leu Val Tyr Ala Leu Gin Asn Gly Gly Thr Val 
Val Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val Ala Asn His Asp Thr Asp He He Trp Asn Lys 
Tyr Pro Ala Tyr Ala Phe He Leu Thr Tyr Glu Gly Gin Pro Val He Phe Tyr Arg Asp Tyr Glu Glu 
Trp Leu Asn Lys Asp Lys Leu Asn Asn Leu He Trp He His Glu His Leu Ala Gly Gly Ser Thr Lys 
He Leu Tyr Tyr Asp Asn Asp Glu Leu He Phe Met Arg Glu Gly Tyr Gly Ser Lys Pro Gly Leu He 
Thr Tyr He Asn Leu Gly Asn Asp Trp Ala Glu Arg Trp Val Asn Val Gly Ser Lys Phe Ala Gly Tyr 
Thr He His Glu Tyr Thr Gly Asn Leu Gly Gly Trp Val Asp Arg Trp Val Gin Tyr Asp Gly Trp Val 
Lys Leu Thr Ala Pro Pro His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Tip Ser Tyr Ala Gly 
Val Gly 

SEQIDNO: 127 

gtgtgcatgaattatttgaaaaaagtgtggttgtattacgctatcgtcgctaccttaatcatttactttcttacgcccttttcaactgcacaagccaacac 

tgcaccagtcaacggaacgatgatgcaatatttcgaatgggatttaccgaatgatggcacactttggacgaaagtaaaaaacgaagcaagcagt 

ctttcttctttaggtattactgcgttatggttaccacctgcatacaaaggaacgagccaaggggatgtcgggtatggcgtgtacgatttgtatgactt 

aggagaatttaatcaaaaagggacgattcgaacgaaatacggaacaaaaacgcaatatttacaagccattcaagcggcaaaaagcgctggcat 

gcaagtatacgctgatgtcgtatttaatcacaaggcgggggcagatagtacagaatgggttgacgcagtcgaagtgaatccttctaatcgaaacc 

aagaaacatctggcacatatcaaattcaagcatggacaaaatttgatttccctggccgtgggaacacatactcaagctttaaatggcgatggtatc 

attttgacggtacggattgggatgaaagccgaaaactaaatcgtatttacaaatttcgtggcacaggaaaagcatgggattgggaagtagacaca 

gagaacggaaactatgactacttaatgtttgctgatttagatatggatcaccctgaagtcgtgacagagctaaaaaactggggaacatggtacgtc 
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aatacgacaaatgtcgatgggmcgcttagatgcagtaaagcatattaaatatagcttcttcccagattggttaacacatgtgcgttcacaaac 

aaaaaatcttmgcagtaggagaatmggagctacgatgtcaataaactgcataactacattacaaaaacaagtggaaccatg^^ 

cgccacttcataacaacttttacactgcttcaaaatctagcgggtatmgacatgcgctamgttaaataatacgngatgaaagaccagccttcto 

tgcggtcacactcgttgataatcatgacacgcaaccgggacaatctttacaatcatgggtagagccttggtttaagccgcttgcttatg^ 

gacaagacaagaaggatatccttgcgtattttacggcgactattacggcatccctaaatacaacattccgggattgaaaagtaaaatcgatccgct 

tctcattgcccgtagagactacgcatacggaacacaacgtgattatattgaccatcaagacattattggatggacacgggaaggaattgactcaa 

aaccgaactctggacttgcggctttaattactgacggccctggtggaagjaaatggatgtatgtaggtaaaaagcatgctggaa^ 

atctcactggaaatcgaagcgatacggtaacgattaatgcagacggctggggagagtttaaagtaaacggtggctccgtttccatttgggttgcc 

aaaacatcacaagtcacgtttaccgtcaacaatgcgacaacgataagcggacaaaatgtgtatgtcgttggtaacattccagagctcggaaattg 

gaacacagcaaacgcaatcaaaatgaccccatcttcttatccaacgtggaaagcaaccattgctcttccacaaggaaaagccattgaatttaaatt 

tattaaaaaagaccaatcgggaaatgngmgggaaagcattccaaaccgaacatacaccgnccattmatcaacaggctcatatacag 

ggaatgtaccttaa 

SEQIDNO: 128 

Val Cys Met Asn Tyr Leu Lys Lys Val Trp Leu Tyr Tyr Ala He Val Ala Thr Leu He He Tyr Phe 
Leu Thr Pro Phe Ser Thr Ala Gin Ala Asn Thr Ala Pro Val Asn Gly Thr Met Met Gin Tyr Phe Glu 
Trp Asp Leu Pro Asn Asp Gly Thr Leu Trp Thr Lys Val Lys Asn Glu Ala Ser Ser Leu Ser Ser Leu 
Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys Gly Thr Ser Gin Gly Asp Val Gly Tyr Gly Val 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr He Arg Thr Lys Tyr Gly Thr Lys Thr 
Gin Tyr Leu Gin Ala He Gin Ala Ala Lys Ser Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe Asn 
His Lys Ala Gly Ala Asp Ser Thr Glu Tip Val Asp Ala Val Glu Val Asn Pro Ser Asn Arg Asn Gta 
Glu Thr Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn Thr Tyr 
Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg 
He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr Asp 
Tyr Leu Met Phe Ala Asp Leu Asp Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly 
Thr Tip Tyr Val Asn Thr Thr Asn Val Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Tyr Ser 
Phe Phe Pro Asp Trp Leu Thr His Val Arg Ser Gin Thr Arg Lys Asn Leu Phe Ala Val Gly Glu Phe 
Trp Ser Tyr Asp Val Asn Lys Leu His Asn Tyr He Thr Lys Thr Ser Gly Thr Met Ser Leu Phe Asp 
Ala Pro Leu His Asn Asn Phe Tyr Thr Ala Ser Lys Ser Ser Gly Tyr Phe Asp Met Arg Tyr Leu Leu 
Asn Asn Thr Leu Met Lys Asp Gin Pro Ser Leu Ala Val Thr Leu Val Asp Asn His Asp Thr Gin 
Pro Gly Gin Ser Leu Gin Ser Trp Val Glu Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr 
Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly He Pro Lys Tyr Asn He Pro Gly 
Leu Lys Ser Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin Arg Asp Tyr 
He Asp His Gin Asp He He Gly Trp Thr Arg Glu Gly He Asp Ser Lys Pro Asn Ser Gly Leu Ala Ala 
Leu He Thr Asp Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Lys His Ala Gly Lys Val Phe 
Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ala Asp Gly Trp Gly Glu Phe Lys Val 
Asn Gly Gly Ser Val Ser He Trp Val Ala Lys Thr Ser Gin Val Thr Phe Thr Val Asn Asn Ala Thr 
Thr lie Ser Gly Gin Asn Val Tyr Val Val Gly Asn He Pro Glu Leu Gly Asn Trp Asn Thr Ala Asn 
Ala He Lys Met Thr Pro Ser Ser Tyr Pro Thr Trp Lys Ala Thr He Ala Leu Pro Gin Gly Lys Ala He 
Glu Phe Lys Phe He Lys Lys Asp Gin Ser Gly Asn Val Val Trp Glu Ser He Pro Asn Arg Thr Tyr 
Thr Val Pro Phe Leu Ser Thr Gly Ser Tyr Thr Ala Ser Trp Asn Val Pro 

SEQIDNO: 129 

ttgcgttgccgccgtggcagggacgggtgttggtgcgggcggcgtaatgcgctgccgcgacacccgcgtgaacaaaataatatgaattatttg 

aataggatgggggtgtcaagaatgacaaaatctcgagagttgcggtgttcatggaaagtatttgttgttgggtgcctgttgtggatggcttgggga 

tcttccgcgtccgccggcgtattgatgcaaggcttctactgggacgccagtaccgggaccagtgattcgtggtggacgcatttggccaagcaag 

ccaacggtctaaaacgggcggggttcaccgccgtatggattcctccggtgcttaaaggggcttcagggggctattccaacgggtacgatccctt 

tgacgactatgatatcggaagcaaggaccagaaaggtaccgtggcgacgcgatgggggacgcgagaagaactgcaacgtgccgtggccgt 

gatgcgcgcgaacggtctggatgtgtatgtggatctggtgctgaaccaccgcaacggggacgacgggaattggaattttcattacaaagatgc 

gtacggcaaagtgggttacgggcggtttcaaaaggggttttacgattttcaccccaactacaacattcaggatgccaatgttcccaacgaggattc 
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cagcttcgggcgcgamagcccatgacaatccgtatgtggccgatggactgaaggctgcaggcgattggctgaccaaagc^^^^^ 

gggatiatcgtctggattacgtgaaaggcatcagctacaccttcctgaaaagttatctgtcctatggggccatgaacggaaaatttgccgtcggtga 

gtactgggatgccaaccgggatacgttgaactggtgggcgaacacggcgatggaagggcgggcccatgtgtttgattttgcgttgcgcgagg 

agctgaaaaacatgtgcaatgcggacgggtactacgacatgcgtcgaUggaccacgcgggtctggtcggaatcgacccgtgg^ 

gacgtttgtcgaaaaccatgatacggatcggcacgaccccatctacaataacaagcatttggcgtatgcctacatcttgacgtcggaagggtatc 

cgacggtgttctggaaggattactaccaatacggaatgaagccgatcatcgacaacctcatttggatccacgaacacattgcgtacggaacgac 

ccaagagcgttggaaagacgaagatgtcmgtg^atgagcggaccggaggcaagcggctattggtggggcttaacgacaatcgcgcc^ 

gcaaaacggtcaccgtacagaccggctttggtgccaacgtggccttgcacgactacaccggcaacggccccgatctccgtaccgacgcctac 

ggtcgggtaaccttgaccattcctgcaaacgggtacgtggcctattccgttccgggcatctccggatcctttgtgccggtcgagaaaaccgtgac 

gcaggagmgccggggcgtccgacttggatattcgtccggccgataacacgcaamgtgcaggtcgggcggatatacgccaaggc^ 

gccggttacagcggaattgtattgggatgccaaagactggacgacctccacgtcgattctcctagaagtgcgttcggcttcgggaacgctcatc 

acgacaaagaccgtgacccaattgtcgtcccagggtacccgcgmccttcacgccttcggctaccggatggtacgtcttttccattcgaagctat 

aacacgccttcgacgaacccaaagccggcctactggttaaaggtaacgtatacggcgccgcaattgcttcagtaa 

SEQIDNO: 130 

Met Arg Cys Arg Arg Gly Arg Asp Gly Cys Trp Cys Gly Arg Arg Asn Ala Leu Pro Arg His Pro 
Arg Glu Gin Asn Asn Met Asn Tyr Leu Asn Arg Met Gly Val Ser Arg Met Thr Lys Ser Arg Glu 
Leu Arg Cys Ser Trp Lys Val Phe Val Val Gly Cys Leu Leu Trp Met Ala Trp Gly Ser Ser Ala Ser 
Ala Gly Val Leu Met Gin Gly Phe Tyr Trp Asp Ala Ser Thr Gly Thr Ser Asp Ser Trp Trp Thr His 
Leu Ala Lys Gin Ala Asn Gly Leu Lys Arg Ala Gly Phe Thr Ala Val Trp He Pro Pro Val Leu Lys 
Gly Ala Ser Gly Gly Tyr Ser Asn Gly Tyr Asp Pro Phe Asp Asp Tyr Asp He Gly Ser Lys Asp Ghi 
Lys Gly Thr Val Ala Thr Arg Tip Gly Thr Arg Glu Glu Leu Gin Arg Ala Val Ala Val Met Arg Ala 
Asn Gly Leu Asp Val Tyr Val Asp Leu Val Leu Asn His Arg Asn Gly Asp Asp Gly Asn Trp Asn 
Phe His Tyr Lys Asp Ala Tyr Gly Lys Val Gly Tyr Gly Arg Phe Gin Lys Gly Phe Tyr Asp Phe His 
Pro Asn Tyr Asn He Gin Asp Ala Asn Val Pro Asn Glu Asp Ser Ser Phe Gly Arg Asp Leu Ala His 
Asp Asn Pro Tyr Val Ala Asp Gly Leu Lys Ala Ala Gly Asp Trp Leu Thr Lys Ala Leu Asp Val 
Gin Gly Tyr Arg Leu Asp Tyr Val Lys Gly He Ser Tyr Thr Phe Leu Lys Ser Tyr Leu Ser Tyr Gly 
Ala Met Asn Gly Lys Phe Ala Val Gly Glu Tyr Trp Asp Ala Asn Arg Asp Thr Leu Asn Trp Trp 
Ala Asn Thr Ala Met Glu Gly Arg Ala His Val Phe Asp Phe Ala Leu Arg Glu Glu Leu Lys Asn 
Met Cys Asn Ala Asp Gly Tyr Tyr Asp Met Arg Arg Leu Asp His Ala Gly Leu Val Gly He Asp 
Pro Trp Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Asp Arg His Asp Pro He Tyr Asn Asn Lys 
His Leu Ala Tyr Ala Tyr He Leu Thr Ser Glu Gly Tyr Pro Thr Val Phe Tip Lys Asp Tyr Tyr Gin 
Tyr Gly Met Lys Pro He He Asp Asn Leu He Tip He His Glu His He Ala Tyr Gly Thr Thr Gin Glu 
Arg Trp Lys Asp Glu Asp Val Phe Val Tyr Glu Arg Thr Gly Gly Lys Arg Leu Leu Val Gly Leu 
Asn Asp Asn Arg Ala Thf Ser Lys Thr Val Thr Val Gin Thr Gly Phe Gly Ala Asn Val Ala Leu His 
Asp Tyr Thr Gly Asn Gly Pro Asp Leu Arg Thr Asp Ala Tyr Gly Arg Val Thr Leu Thr He Pro Ala 
Asn Gly Tyr Val Ala Tyr Ser Val Pro Gly He Ser Gly Ser Phe Val Pro Val Glu Lys Thr Val Thr 
Gin Glu Phe Ala Gly Ala Ser Asp Leu Asp He Arg Pro Ala Asp Asn Thr Gin Phe Val Gin Val Gly 
Arg He Tyr Ala Lys Ala Asn Lys Pro Val Thr Ala Glu Leu Tyr Trp Asp Ala Lys Asp Trp Thr Thr 
Ser Thr Ser He Leu Leu Glu Val Arg Ser Ala Ser Gly Thr Leu He Thr Thr Lys Thr Val Thr Gin Leu 
Ser Ser Gin Gly Thr Arg Val Ser Phe Thr Pro Ser Ala Thr Gly Trp Tyr Val Phe Ser He Arg Ser Tyr 
Asn Thr Pro Ser Thr Asn Pro Lys Pro Ala Tyr Trp Leu Lys Val Thr Tyr Thr Ala Pro Gin Leu Leu 
Gin 

SEQroNO: 131 

atgccgcagctttacccattgccgccgcgctggcggcgcgcggcccggcagggcctggccgccttgacgctggccaccacggccctgggc 
atctcgacggcccaggcccagagtgcaccgcgcacggccttcgtgcatctgttcgaatggaagtggaccgacatcgcgcgcgagtgcgaga 
ccttcctcgggcccaagggcttcgcggcggtgcaggtgtcgcccccgaacgagcacaactgggtgaccagcggtgatggtgcaccttatccg 
tggtggatgcgctaccagccggtgagctacagcctggaccgcagccgcagcggcacgcgcgccgagttccaggacatggtcaaccgatgc 
aatgccgtgggcgtgggcatctacgtggacgccgtgatcaatcacatgtccggcggcacgggcggcacctcgagcgctgggcgcagctgg 
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agctatcacaactaccctgggctctatggccccaacgacttccaccagccggtgtgcagcatcaccaactacggggatgc^^ 

cgttgcgagctctcgggcttgcaggacctggacactgggagcgcttatgtgcgcggcaagatcgccgactatctggtggatctg^^ 

ggggtcaagggcttccgggtggatgcggccaagcacatcagcccgaccgacctgggcgccatcatcgatgcggtcaacagccgcaccggc 

gcgaaccgccctttctggtttctggaggtgattggcgcggccggcgaggcagtgcagccgaaccagtacttctcgctcggcggcggccaggt 

caccgtgaccgagttcaactatgggaagcaaatcttcggcaagttcgccggtggcggccgtctggccgagctgcgcagcttcggtgaaacctg 

gggcctgatgcccagcagcaaagcgattgctttcatcgacaaccacgacaagcagcgcggtcatggcggcggtggcaactatctgacctacc 

accatggctcgacctacgatctggccaacatcttcatgctggcttggccttatggctacccggcgctgatgto 

acggcctacgacacgagctttggcccgccacacgacagtggtggcgccacccgtggcccctgggatggtggcggcagccagccggcctgc 

ttcaaccagagcatcggtggctgggtgtgtgagcaccgctggcggggcatcgccaatatggtggccttccgcaacgccacgctgcccaactg 

gaccgtgaccgactggtgggacaacggcaacaaccagatcgcmcgggcggggtgacaagggcncgtggtgatcaaccgcgaagac^^ 

cgcgctgacgcgcaacttcaagaccagcctgccagccggccagtactgcgatgtcatctccggggacttcaacaatggtcagtgcacgggcc 

atgtggtgacggtcgatgccggcggctacgtgacgctgacggccgggcccaatggtgcggcggccatccacgtgggcgcccgtctggacg 

gcgcctctcagccgccgacgaccgcctcggtgacgttcaacgcgtcggccgatacctmggggacagaacctgttcgtcgtgggcaaccaca 

gcgcactgggcaactggtcgccggcggccgccaggccgatgacttggatttccggttcgggcacgcgcgggaactggcgcgcggtgctca 

atttgccggccaataccacctaccaatacaagttcatcaagaaggacggggctggaaacgtggtttgggagggcggtggcaatcgcgtcgtga 

ccacgccgtctgggggcggatcggtgagcacgggcggcaattggcagtag 

SEQIDNO:132 

Met Pro Gin Leu Tyr Pro Leu Pro Pro Arg Trp Arg Arg Ala Ala Arg Gin Gly Leu Ala Ala Leu Thr 
Leu Ala Thr Thr Ala Leu Gly He Ser Thr Ala Gin Ala Gin Ser Ala Pro Arg Thr Ala Phe Val His 
Leu Phe Glu Trp Lys Trp Thr Asp lie Ala Arg Glu Cys Glu Thr Phe Leu Gly Pro Lys Gly Phe Ala 
Ala Val Gin Val Ser Pro Pro Asn Glu His Asn Trp Val Thr Ser Gly Asp Gly Ala Pro Tyr Pro Trp 
Trp Met Arg Tyr Gin Pro Val Ser Tyr Ser Leu Asp Arg Ser Arg Ser Gly Thr Arg Ala Glu Phe Gin 
Asp Met Val Asn Arg Cys Asn Ala Val Gly Val Gly He Tyr Val Asp Ala Val He Asn His Met Ser 
Gly Gly Thr Gly Gly Thr Ser Ser Ala Gly Arg Ser Trp Ser Tyr His Asn Tyr Pro Gly Leu Tyr Gly 
Pro Asn Asp Phe His Gin Pro Val Cys Ser He Thr Asn Tyr Gly Asp Ala Asn Asn Val Gin Arg Cys 
Glu Leu Ser Gly Leu Gin Asp Leu Asp Thr Gly Ser Ala Tyr Val Arg Gly Lys He Ala Asp Tyr Leu 
Val Asp Leu Val Asn Met Gly Val Lys Gly Phe Arg Val Asp Ala Ala Lys His He Ser Pro Thr Asp 
Leu Gly Ala He He Asp Ala Val Asn Ser Arg Thr Gly Ala Asn Arg Pro Phe Trp Phe Leu Glu Val 
He Gly Ala Ala Gly Glu Ala Val Gin Pro Asn Gin Tyr Phe Ser Leu Gly Gly Gly Gin Val Thr Val 
Thr Glu Phe Asn Tyr Gly Lys Gin He Phe Gly Lys Phe Ala Gly Gly Gly Arg Leu Ala Glu Leu Arg 
Ser Phe Gly Glu Thr Trp Gly Leu Met Pro Sef Ser Lys Ala He Ala Phe He Asp Asn His Asp Lys 
Gin Arg Gly His Gly Gly Gly Gly Asn Tyr Leu Thr Tyr His His Gly Ser Thr Tyr Asp Leu Ala Asn 
He Phe Met Leu Ala Trp Pro Tyr Gly Tyr Pro Ala Leu Met Ser Ser Tyr Ala Phe Asn Arg Ser Thr 
Ala Tyr Asp Thr Ser Phe Gly Pro Pro His Asp Ser Gly Gly Ala Thr Arg Gly Pro Tip Asp Gly Gly 
Gly Ser Gin Pro Ala Cys Phe Asn Gin Ser He Gly Gly Trp Val Cys Glu His Arg Trp Arg Gly He 
Ala Asn Met Val Ala Phe Arg Asn Ala Thr Leu Pro Asn Trp Thr Val Thr Asp Trp Trp Asp Asn 
Gly Asn Asn Gb He Ala Phe Gly Arg Gly Asp Lys Gly Phe Val Val He Asn Arg Glu Asp Ala Ala 
Leu Thr Arg Asn Phe Lys Thr Ser Leu Pro Ala Gly Gin Tyr Cys Asp Val He Ser Gly Asp Phe Asn 
Asn Gly Gin Cys Thr Gly His Val Val Thr Val Asp Ala Gly Gly Tyr Val Thr Leu Thr Ala Gly Pro 
Asn Gly Ala Ala Ala He His Val Gly Ala Arg Leu Asp Gly Ala Ser Ghi Pro Pro Thr Thr Ala Ser 
Val Thr Phe Asn Ala Ser Ala Asp Thr Phe Trp Gly Gin Asn Leu Phe Val Val Gly Asn His Ser Ala 
Leu Gly Asn Trp Ser Pro Ala Ala Ala Arg Pro Met Thr Trp He Ser Gly Ser Gly Thr Arg Gly Asn 
Trp Arg Ala Val Leu Asn Leu Pro Ala Asn Thr Thr Tyr Gin Tyr Lys Phe He Lys Lys Asp Gly Ala 
Gly Asn Val Val Trp Glu Gly Gly Gly Asn Arg Val Val Thr Thr Pro Ser Gly Gly Gly Ser Val Ser 
Thr Gly Gly Asn Trp Gin 

SEQIDNO: 133 

atgaataatgtgaaaaaagtatggttgtattattctataattgctaccttagttatttcctttttcacacctttttcaacagcacaagctaatactgcacctg 
tcaacggaacaatgatgcaatatttcgaatgggatttaccgaatgatgggacgctttggacgaaagtaaaaaatgaagctaccaatctttcttcgct 
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aggtattacagcgttatggctccctccagcatataaaggaacgagccaaagcgatgtcggatatggcgtgtacgatt^ 

taatcaaaaagggacgatccgaacgaaatacggaacaaaagcacaatatattcaagccatccaagctgccaaagccgcagggatgcaagtat 

atgcagatgttgtatttaatcataaggcgggggctgacggcacagaatttgtcgatgcagttgaggtaaacccttctaatcgaaatcaagaaacat 

ctggcacatatcaaattcaagcatggacaaaamgatmcctggtcgtggaaacacatactccagcttcaaatggcgctggtatcatmgac^ 

accgattgggatgaaagtcgtaaattaaatcgtatttacaaattccgcggtacaggaaaagcgtgggactgggaagtcgatacagaaaacgga 

aactatgattatttaatgttcgctgatttagatatggatcaccctgaagttgtgacagagttaaaaaactggggaaaatggtatg^ 

tgtagacggamcgmggatgccgtaaaacatattaaatacagcttmccctgactggctaacatatgtacgtaatcaaaw 

gctgttggggaatmggagctatgacgtcaataagctgcataactacattacaaaaacaaatggatcgatgtcgttamgatg^ 

caacttttatatcgcnccaaatcgagtggatatmgacatgcgttatttangaataatacattaatgaaagatcaaccttca^ 

cgataaccatgatacacaaccaggtcaatctttacaatcatgggtagaagcttggtttaaaccgcttgcttacgcctttaU^ 

gggtatccttgcgtatmacggtgactattacggaatcccgaaatacaalaltccgggattaaaaagtaaaattgatccgctmaattgctcgtcgt 

gattatgcttatggaacacaacgtgattacattgatcatcaagacattatcggatggacacgagaaggcattgatgcaaaaccgaactctggactt 

gcggctttaattaccgacggccctggcggaagtaaatggatgtatgtcggtaaaaaacatgctgggaaagtgttttatgatttaactggaaatcga 

agtgacacagtaacgattaatgcggacggttggggagaatttaaagtaaacggcggctccgWcgamgggtggctaaaacatcaaacgtca 

catttacagtcaataacgccacaacaacaagtggacaaaacgtatatgttgttggcaacattccagagctaggcaattctttg 

SEQIDNO: 134 

Met Asn Asn Val Lys Lys Val Tip Leu Tyr Tyr Ser He He Ala Thr Leu Val He Ser Phe Phe Thr Pro 
Phe Ser Thr Ala Gin Ala Asn Thr Ala Pro Val Asn Gly Thr Met Met Gin Tyr Phe Glu Tip Asp Leu 
Pro Asn Asp Gly Thr Leu Trp Thr Lys Val Lys Asn Glu Ala Thr Asn Leu Ser Ser Leu Gly He Thr 
Ala Leu Tip Leu Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 
Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr He Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr He 
Gin Ala He Gin Ala Ala Lys Ala Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe Asn His Lys Ala 
Gly Ala Asp Gly Thr Glu Phe Val Asp Ala Val Glu Val Asn Pro Ser Asn Arg Asn Gin Glu Thr Ser 
Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe 
Lys Trp Arg Trp Tyr His Phe Asp Gly Thr Asp Tip Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 
Phe Arg Gly Thr Gly Lys Ala Trp Asp Tip Glu Val Asp Thr Glu Asn Gly Asn Tyr Asp Tyr Leu 
Met Phe Ala Asp Leu Asp Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp 
Tyr Val Asn Thr Thr Asn Val Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Tyr Ser Phe Phe 
Pro Asp Trp Leu Thr Tyr Val Arg Asn Gin Thr Gly Lys Asn Leu Phe Ala Val Gly Glu Phe Tip Ser 
Tyr Asp Val Asn Lys Leu His Asn Tyr He Thr Lys Thr Asn Gly Ser Met Ser Leu Phe Asp Ala Pro 
Leu His Asn Asn Phe Tyr He Ala Ser Lys Ser Ser Gly Tyr Phe Asp Met Arg Tyr Leu Leu Asn Asn 
Thr Leu Met Lys Asp Gin Pro Ser Leu Ala Val Thr Leu Val Asp Asn His Asp Thr Gin Pro Gly Gin 
Ser Leu Gbi Ser Trp Val Glu Ala Tip Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Gin Glu 
Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly He Pro Lys Tyr Asn He Pro Gly Leu Lys Ser 
Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin Arg Asp Tyr He Asp His 
Gin Asp He He Gly Trp Thr Arg Glu Gly He Asp Ala Lys Pro Asn Ser Gly Leu Ala Ala Leu He Thr 
Asp Gly Pro Gly Gly Ser Lys Tip Met Tyr Val Gly Lys Lys His Ala Gly Lys Val Phe Tyr Asp Leu 
Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ala Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly 
Ser Val Ser He Trp Val Ala Lys Thr Ser Asn Val Thr Phe Thr Val Asn Asn Ala Thr Thr Thr Ser 
Gly Gin Asn Val Tyr Val Val Gly Asn He Pro Glu Leu Gly Asn Ser Leu . 

SEQroNO: 135 

gtgacaggcaccccgtctttatacattcctccacataaaataaccatacagctttcaaatttgttgaaatgtataaaaataaaaaatagtattgtaagc 

gttaacatccgtcattataataacttcaaacgcgtttatgttttaatgcaaacgtttgcatcctcattttatttaaagaaaggatgtgtgtgcatgaattatt 

tgaaaaaagtgtggttgtattacgctatcgtcgctaccttaatcatttcctttcttacgcccttttcaactgcacaagccaacactgcaccagtcaacg 

gaacgatgatgcaatatttcgaatgggatttaccgaatgatggcacactttggacgaaagtaaaaaacgaagcaagcagcctttcttctttaggtat 

tactgcgttatggttaccacctgcatacaaaggaacgagccaaggggatgtcgggtatggcgtgtacgatttgtatgacttaggagaatttaatca 

aaaagggacgattcgaacgaaatacggaacaaaaacgcaatatttacaagccattcaagcggcaaaaagcgctggcatgcaagtatacgctg 

atgtcgtatttaatcacaaggcgggggcagatagtacagaatgggttgacgcagtcgaagtgaatccttctaatcgaaaccaagaaacatctgg 
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cacatatcaaattcaagcatggacaaaatttgamccctgaccgtgggaacacatactcaagctttaaatggcgctggtatcatmgacggtac^ 

galtgggatgaaagtcgaaaactaaatcgcatttacaaiatttcgtggcacaggaaaagcatgggattgggaagtagacacagagaacggaaac 

tatgactacttaatgmgctgatttagatatggatcaccctgaagtcgtgacagagctaaaaaactggggaacatggtacgtcaatacgacaaat^ 

tcgatgggtttcgcttagatgcagtaaagcatattaaatatagctttttcccagattggttaacatatgtgcgctcacaaaca 

cagtaggagaatmggagctacgatgtcaataaactgcataactacattacaaaaacaagtggaaccatgtcgttaWgatgcgcca 

caacttttacactgcttcaaaatctagcgggtamtgacatgcgctamgttaaataatacgttgatgaaagaccagccttctcttgcggtca 

gttgataatcatgacacgcaaccgggacaatctttacaatcatgggtagagccttggtttaagccgcttgcttatgccmatmgacaagacM 

aggatatccttgcgtattttacggcgactattacggcatccctaaatacaatattccgggattgaaaagtaaaatcgatccgcttctcattgcccgta 

gagactacgcatacggaacacaacgtgattatattgaccatcaagacattattggatggacacgggaaggaattgactcaaaaccgaactctgg 

acttgcggctttaattactgacggtcctggtggaagtaaatggatgtatgtaggtaaaaagcatgctggaaaagtgttttacgate 

cgaagcgatacggtaacgattaatgcagacggctggggagagmaaagtaaacggtggctccgtttccatttgggttgccaaaacatcacaag 

tcacgtttaccgtcaacaatgcgacaacgacaagcggacaaaatgtgtatgtcgttggcaacattccagagctcggaaattggaacacagcaaa 

cgcaatc^aaatgaccccatcttcttatccaacgtggaaaacaaccat^ctcnccacaaggaaaagcaattggc^^ 

ga 

SEQIDNO: 136 

Val Thr Gly Thr Pro Ser Leu Tyr He Pro Pro His Lys He Thr He Gin Leu Ser Asn Leu Leu Lys Cys 
He Lys He Lys Asn Ser He Val Ser Val Asn He Arg His Tyr Asn Asn Phe Lys Arg Val Tyr Val Leu 
Met Gin Thr Phe Ala Ser Ser Phe Tyr Leu Lys Lys Gly Cys Val Cys Met Asn Tyr Leu Lys Lys Val 
Trp Leu Tyr Tyr Ala He Val Ala Thr Leu He He Ser Phe Leu Thr Pro Phe Ser Thr Ala Gin Ala Asn 
Thr Ala Pro Val Asn Gly Thr Met Met Gin Tyr Phe Glu Tip Asp Leu Pro Asn Asp Gly Thr Leu 
Trp Thr Lys Val Lys Asn Glu Ala Ser Ser Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro 
Ala Tyr Lys Gly Thr Ser Gin Gly Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
Asn Gin Lys Gly Thr He Arg Thr Lys Tyr Gly Thr Lys Thr Gin Tyr Leu Gin Ala He Gin Ala Ala 
Lys Ser Ala Gly Met Gin Val Tyr Ala Asp Val Val Phe Asn His Lys Ala Gly Ala Asp Ser Thr Glu 
Trp Val Asp Ala Val Glu Val Asn Pro Ser Asn Arg Asn Gin Glu Thr Ser Gly Thr Tyr Gin He Gin 
Ala Trp Thr Lys Phe Asp Phe Pro Asp Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Tip Tyr His 
Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys Phe Arg Gly Thr Gly Lys 
Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Phe Ala Asp Leu Asp 
Met Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Thr Trp Tyr Val Asn Thr Thr Asn Val 
Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Tyr Ser Phe Phe Pro Asp Trp Leu Thr Tyr Val 
Arg Ser Gin Thr Gin Lys Asn Leu Phe Ala Val Gly Glu Phe Trp Ser Tyr Asp Val Asn Lys Leu His 
Asn Tyr He Thr Lys Thr Ser Gly Thr Met Ser Leu Phe Asp Ala Pro Leu His Asn Asn Phe Tyr Thr 
Ala Ser Lys Ser Ser Gly Tyr Phe Asp Met Arg Tyr Leu Leu Asn Asn Thr Leu Met Lys Asp Gin 
Pro Ser Leu Ala Val Thr Leu Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Gin Ser Trp Val 
Glu Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe 
Tyr Gly Asp Tyr Tyr Gly He Pro Lys Tyr Asn He Pro Gly Leu Lys Ser Lys He Asp Pro Leu Leu He 
Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin Arg Asp Tyr He Asp His Gin Asp He He Gly Trp Thr 
Arg Glu Gly He Asp Ser Lys Pro Asn Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro Gly Gly Ser 
Lys Trp Met Tyr Val Gly Lys Lys His Ala Gly Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp 
Thr Val Thr lie Asn Ala Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser He Trp Val 
Ala Lys Thr Ser Gin Val Thr Phe Thr Val Asn Asn Ala Thr Thr Thr Ser Gly Gin Asn Val Tyr Val 
Val Gly Asn He Pro Glu Leu Gly Asn Trp Asn Thr Ala Asn Ala He Lys Met Thr Pro Ser Ser Tyr 
Pro Thr Trp Lys Thr Thr He Ala Leu Pro Gin Gly Lys Ala He Gly Gly Val Arg His Gly Pro 

SEQ ID NO: 137 

gtgggacgggcaggcttggcgcatcactcgaacacttccgccaaggggacatacgggtcacctctcgaactgcgtccggatcgccccgccgt 
ggccggggcggtcgagcttgaagatgtccagcggggagccgccgccgaggatcaccccggcggcgtactcgcccagggcggggctcag 
cttgaagccgtggccggagccgcctcccaggagccagacgttggaggcccgcggatggcggtcgaggaggaggtggccgtcggggctgt 
tctcgtactggcagacgcgggtctcgaccagcggcgcgtccttcagggccgggaaccggcgggccacctcggcccgggccgcttccagca 
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gggccggggtgatcgtccgctcgcccgccgtgggatcgatgggctcgccccgggtgtcg^ccgccaccttgaagccgcggtgcte 

ggggatgccgtagtagatccgctcgccgagatcgacccagaccggacagccgccctcctggaagcgcgggtcgcccggcggcgtgccga 

agaagaacacctcctggcgggtgttgcggaggaaccgctcaccgatcacgtccgggaacagcccggccagccagggaccgcaggcgaag 

acgtagaggtcggccgcgagagtggagccgtccgaaaggtgaagccgctccaagggccccgggaccatgg^ 

gccctcgccctggaacagctccaccacggtccggcaggcgcgccgggcgaacagggcgccggcttcctcctcgtaccagatcgtgcggac 

gccgtcgaaatcgacctgggggaagcggctccgggcctccccctgagacagctcggcgaccggcagccccgcgtcctccagaaaaggaa 

gggagtcgcggacgtagctgtcgtcctcgccgcacatccagaggaccccggtcctmgtacagccggtaaccggactgg 

gccagagctcgaaggagcgggcgacccactccacgtacagacggtcgggtccgtaggcgccgcggatgatccgcgtctcgccaccggag 

ctggagcgggagtgccccggaccccaggcgtccaggagggtcacccgggctccgcggcggaggagatgcagggcggtccagccgccg 

aaggcgccggcgccgacgacggcgaUtggggatgggagggcatggcgggcgtaaggttatcgcagcccgatccttcgctggc^^^ 

ctccgaccggagtatcctggaaaattcgaagaaggagatcgacatgcaatcgaacggaaacgtga 

SEQIDNO:138 

Val Gly Arg Ala Gly Leu Ala His His Ser Asn Thr Ser Ala Lys Gly Thr Tyr Gly Ser Pro Leu Glu 
Leu Arg Pro Asp Arg Pro Ala Val Ala Gly Ala Val Glu Leu Glu Asp Val Gin Arg Gly Ala Ala Ala 
Glu Asp His Pro Gly Gly Val Leu Ala Gin Gly Gly Ala Gin Leu Glu Ala Val Ala Gly Ala Ala Ser 
Gin Glu Pro Asp Val Gly Gly Pro Arg Met Ala Val Glu Glu Glu Val Ala Val Gly Ala Val Leu Val 
Leu Ala Asp Ala Gly Leu Asp Gin Arg Arg Val Leu Gin Gly Arg Glu Pro Ala Gly His Leu Gly 
Pro Gly Arg Phe Gin Gin Gly Arg Gly Asp Arg Pro Leu Ala Arg Arg Gly He Asp Gly Leu Ala Pro 
Gly Val Val Arg His Leu Glu Ala Ala Val Leu Val Ala Gly Asp Ala Val Val Asp Pro Leu Ala Glu 
He Asp Pro Asp Arg Thr Ala Ala Leu Leu Glu Ala Arg Val Ala Arg Arg Arg Ala Glu Glu Glu His 
Leu Leu Ala Gly Val Ala Glu Glu Pro Leu Thr Asp His Val Arg Glu Gin Pro Gly Gin Pro Gly Thr 
Ala Gly Glu Asp Val Glu Val Gly Arg Glu Ser Gly Ala Val Arg Lys Val Lys Pro Leu Gin Gly Pro 
Arg Asp His Gly Gly Leu Pro Val Leu Pro Ala Leu Ala Leu Glu Gin Leu His His Gly Pro Ala Gly 
Ala Pro Gly Glu Gin Gly Ala Gly Phe Leu Leu Val Pro Asp Arg Ala Asp Ala Val Glu He Asp Leu 
Gly Glu Ala Ala Pro Gly Leu Pro Leu Arg Gin Leu Gly Asp Arg Gin Pro Arg Val Leu Gin Lys 
Arg Lys Gly Val Ala Asp Val Ala Val Val Leu Ala Ala His Pro Glu Asp Pro Gly Pro Phe Val Gin 
Pro Val Thr Gly Leu Asp Phe Gly Val Pro Pro Glu Leu Glu Gly Ala Gly Asp Pro Leu His Val Gin 
Thr Val Gly Ser Val Gly Ala Ala Asp Asp Pro Arg Leu Ala Thr Gly Ala Gly Ala Gly Val Pro Arg 
Thr Pro Gly Val Gin Glu Gly His Pro Gly Ser Ala Ala Glu Glu Met Gin Gly Gly Pro Ala Ala Glu 
Gly Ala Gly Ala Asp Asp Gly Asp Met Gly Met Gly Gly His Gly Gly Arg Lys Val He Ala Ala Arg 
Ser Phe Ala Gly He Pro Ser Pro Thr Gly Val Ser Trp Lys He Arg Arg Arg Arg Ser Thr Cys Asn 
Arg Thr Glu Thr 

SEQroNO:139 

atgaaaacattcaaccttaaacccacacttttacctttaactttgctgctgagttcgccggtattggcggcacaaaatggaactatgatgcagtatttc 

cattggtatgtgccaaatgacggcgcactctggacacaagttgaaaacaatgcgccagcactatccgacaacggttttacagcgctgtggttgc 

caccagcatataaaggcgcaggtggtagcaacgacgttggttacggtgtttacgatatgtatgacttaggggagtttgatcaaaaaggatcggta 

cgaactaagtacggcaccaaagaccaatatctaaatgccatcaaagcagcacacaaaaacaatatccaaatttatggtgacgtagt 

tcgtggcggtgcagatggcaagtcgtgggtcgataccaagcgtgtggattggaataaccgcaatattgaacttggcgataaatggattgaagca 

tgggttgaatttagcttcccaggacgtaacgataaatactcagacttccattggacgtggtatcactttgatggcgtcgattgggatgacgcagg^^ 



gacttagacatggatcacccagaagtgaagcaagagctgaaagattggggtgaatggtacttaaacatgacgggtgttgatggcttccgaatgg 
atgcagtgaaacacatcaaatatcagtacctacaagagtggatcgattacttgcgtaagaaaacgggcaaagagctctttaccgttggtgagtac 
tggaactacgacgtgaacaatctgcacaactttatgactaagacttctggcagcatgtcattgtttgatgcgcctttacatatgaacttctataacgct 
tcacgctctggtggcaactttgatatgcgccgaatcatggatggcaccttgatgaaagacaacccagtgaaagcagtaacactggttgagaacc 
atgatacgcaaccactacaggccttagagtctccggtggattggtggttcaaaccacttgcgtacgcgttcattttgcttcgtgaggaaggttatcc 
gtcagtcttctacgcagattactacggtgcgcaatacagcgataaagggcacgatatcaacatggtgaaagtgccttacattgagcaattggtga 
aagcgcgtaaagattatgcttatggtaaacaacattcttaccttgaccactgggatgtgattggttggacacgagaaggggatgcggaacatccg 



\ 
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cgtaccgaagaagtatggactaacgctagtggatgggccgaancccagtgaacggcggatcggmctgmgggnggcgttaaato^ 
SEQIDNO:140 

Met Lys Thr Phe Asn Leu Lys Pro Thr Leu Leu Pro Leu Thr Leu Leu Leu Ser Ser Pro Val Leu Ala 
Ala Gin Asn Gly Thr Met Met Gin Tyr Phe His Trp Tyr Val Pro Asn Asp Gly Ala Leu Trp Thr Gin 
Val Glu Asn Asn Ala Pro Ala Leu Ser Asp Asn Gly Phe Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
Gly Ala Gly Gly Ser Asn Asp Val Gly Tyr Gly Val Tyr Asp Met Tyr Asp Leu Gly Glu Phe Asp 
Gin Lys Gly Ser Val Arg Thr Lys Tyr Gly Thr Lys Asp Gin Tyr Leu Asn Ala He Lys Ala Ala His 
Lys Asn Asn He Gin He Tyr Gly Asp Val Val Phe Asn His Arg Gly Gly Ala Asp Gly Lys Ser Tip 
Val Asp Thr Lys Arg Val Asp Trp Asn Asn Arg Asn He Glu Leu Gly Asp Lys Trp He Glu Ala Trp 
Val Glu Phe Ser Phe Pro Gly Arg Asn Asp Lys Tyr Ser Asp Phe His Trp Thr Trp Tyr His Phe Asp 
Gly Val Asp Trp Asp Asp Ala Gly Lys Glu Lys Ala He Phe Lys Phe Lys Gly Asp Gly Lys Ala Tip 
Asp Trp Glu Val Ser Ser Glu Lys Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 
His Pro Glu Val Lys Gin Glu Leu Lys Asp Trp Gly Glu Tip Tyr Leu Asn Met Thr Gly Val Asp Gly 
Phe Arg Met Asp Ala Val Lys His He Lys Tyr Gin Tyr Leu Gin Glu Trp He Asp Tyr Leu Arg Lys 
Lys Thr Gly Lys Glu Leu Phe Thr Val Gly Glu Tyr Trp Asn Tyr Asp Val Asn Asn Leu His Asn 
Phe Met Thr Lys Thr Ser Gly Ser Met Ser Leu Phe Asp Ala Pro Leu His Met Asn Phe Tyr Asn Ala . 
Ser Arg Ser Gly Gly Asn Phe Asp Met Arg Arg He Met Asp Gly Thr Leu Met Lys Asp Asn Pro 
Val Lys Ala Val Thr Leu Val Glu Asn His Asp Thr Gin Pro Leu Gin Ala Leu Glu Ser Pro Val Asp 
Trp Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Leu Arg Glu Glu Gly Tyr Pro Ser Val Phe Tyr 
Ala Asp Tyr Tyr Gly Ala Gin Tyr Ser Asp Lys Gly His Asp He Asn Met Val Lys Val Pro Tyr He 
Glu Gin Leu Val Lys Ala Arg Lys Asp Tyr Ala Tyr Gly Lys Gin His Ser Tyr Leu Asp His Trp Asp 
Val He Gly Tip Thr Arg Glu Gly Asp Ala Glu His Pro Asn Ser Met Ala Val He Met Ser Asp Gly 
Pro Gly Gly Thr Lys Trp Met Tyr Thr Gly Ser Pro Ser Thr Arg Tyr Val Asp Lys Leu Gly He Arg 
Thr Glu Glu Val Tip Thr Asn Ala Ser Gly Trp Ala Glu Phe Pro Val Asn Gly Gly Ser Val Ser Val 
Trp Val Gly Val Lys 

SEQIDNO: 141 

atgaaaccaataaataccctactcatatccgcccttgctgtttgttctttcagttccgcgacttacgccgatactattttgcacgcgttcaatt^^ 

attcagatgtgacggccaacgcgaatcaaattgctcaagctggttataagaaagtgcttgttgcgcctgcaatgaaatcgagtggcagccaatgg 

tgggctcgctatcaacctcaagatctacgcactatcgattctcctttgggcaataaacaagatttagccgcaatgattgccgcactcaaaggtgtg 

ggcgtcgatgtgtatgccgatgtggtactcaaccatatggcgaatgaaagctggaagcgaagtgacttgaattaccctggcacagaagtgcta^ 

acgattatgctagccgttcaagctactatgctgaccagactctgtttggcaacctagcacaaggttatgtgtcagcgaacgactttcatccagcgg 

gctgtatttcagattggaacgaccctggtcatgttcagtattggcgtttgtgtggcgcagatggtgatgtaggtttacctgaccttgatccaaaca 

tgggtggtttcacaacagcgtttgtatctgaaagcgctaaaagatatgggcatcaaagggttccgaattgatgcagtgaagcacatgagccaat^ 

ccaaatcgatcaggtattcacgtctgaaattactgcgaacatgcatgtgtttggtgaagtgattactagcggtggagcagggaatagcggctatg 

aatcgttcttagcgccttacctgaataatactaatcactctgcctacgatttcccgctgtttgcatcgattcgctcggcattttctatggggggcggttt 

aaatcaactgcatgatcctaaagcgtacggtcaggcacttgatgataatcgctcgatcacctttgcgatcacacatgatattccaaccaatgacgg 

cttccgctaccaaattatggacccacaagacgagcagcttgcttacgcgtatatccttggtaaagacggtggcacgccgctgatctacagtgatg 

atcttcctgattctgaagacaaggataacggtcgttggggcaatgtttggaacagttcgacaatgaaaaacatgttgagcttccataacgcgatgc 

aaggcaaaacaatgacgatgatttctagcgaccattgcactttgttgtttaagcgtggcaaagaaggtgttgtgggtattaacaagtgtg^ 

cgcgtggcgtgacggttgatacctaccaacatgagtttaattggcatgttcaatacaaagacgtgttaagcagcgcaacagaaaccgtgacttct 

cgttaccatacgttcaatctaccaccacgcagtgcgcgtatgtttaagctgtag 

SEQ ID NO: 142 

Met Lys Pro He Asn Thr Leu Leu He Ser Ala Leu Ala Val Cys Ser Phe Ser Ser Ala Thr Tyr Ala 
Asp Thr He Leu His Ala Phe Asn Trp Lys Tyr Ser Asp Val Thr Ala Asn Ala Asn Gin He Ala Gin 
Ala Gly Tyr Lys Lys Val Leu Val Ala Pro Ala Met Lys Ser Ser Gly Ser Gin Trp Trp Ala Arg Tyr 
Gin Pro Gin Asp Leu Arg Thr He Asp Ser Pro Leu Gly Asn Lys Gin Asp Leu Ala Ala Met He Ala 
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Ala Leu Lys Gly Val Gly Val Asp Val Tyr Ala Asp Val Val Leu Asn His Met Ala Asn Glu Ser Trp 
Lys Arg Ser Asp Leu Asn Tyr Pro Gly Thr Glu Val Leu Asn Asp Tyr Ala Ser Arg Ser Ser Tyr Tyr 
Ala Asp Gin Thr Leu Phe Gly Asn Leu Ala Gin Gly Tyr Val Ser Ala Asn Asp Phe His Pro Ala Gly 
Cys He Ser Asp Trp Asn Asp Pro Gly His Val Gin Tyr Trp Arg Leu Cys Gly Ala Asp Gly Asp Val 
Gly Leu Pro Asp Leu Asp Pro Asn Asn Trp Val Val Ser Gin Gin Arg Leu Tyr Leu Lys Ala Leu 
Lys Asp Met Gly He Lys Gly Phe Arg He Asp Ala Val Lys His Met Ser Gin Tyr Gin He Asp Gin 
Val Phe Thr Ser Glu He Thr Ala Asn Met His Val Phe Gly Glu Val He Thr Ser Gly Gly Ala Gly 
Asn Ser Gly Tyr Glu Ser Phe Leu Ala Pro Tyr Leu Asn Asn Thr Asn His Ser Ala Tyr Asp Phe Pro 
Leu Phe Ala Ser He Arg Ser Ala Phe Ser Met Gly Gly Gly Leu Asn Gin Leu His Asp Pro Lys Ala 
Tyr Gly Gin Ala Leu Asp Asp Asn Arg Ser He Thr Phe Ala He Thr His Asp He Pro Thr Asn Asp 
Gly Phe Arg Tyr Gin He Met Asp Pro Gin Asp Glu Gin Leu Ala Tyr Ala Tyr He Leu Gly Lys Asp 
Gly Gly Thr Pro Leu He Tyr Ser Asp Asp Leu Pro Asp Ser Glu Asp Lys Asp Asn Gly Arg Trp Gly 
Asn Val Tip Asn Ser Ser Thr Met Lys Asn Met Leu Ser Phe His Asn Ala Met Gin Gly Lys Thr 
Met Thr Met He Ser Ser Asp His Cys Thr Leu Leu Phe Lys Arg Gly Lys Glu Gly Val Val Gly He 
Asn Lys Cys Gly Glu Thr Arg Gly Val Thr Val Asp Thr Tyr Gin His Glu Phe Asn Tip His Val Gin 
Tyr Lys Asp Val Leu Ser Ser Ala Thr Glu Thr Val Thr Ser Arg Tyr His Thr Phe Asn Leu Pro Pro 
Arg Ser Ala Arg Met Phe Lys Leu 

SEQIDNO: 143 

atgccaaagagcacttttaccaaatccataacaaaatcacttcttgctacttccgttgttgtaagcttattgcctgcctacgcacaggccgacactat 

cttgcatgcctttaactggaaatacagcgacattacccgccaagcagagcaaattgcgcaagctggttataaaaaagtactgatttcaccgccgc 

tgaagtccacaggcccacaatggtgggcacgttaccaaccacaggacattcgagtgattgactcccctgtcggcaacaagcaagatttacaag 

cQctcattgcagccttaaaggcacaaggcgttgaagtatacgcagacatcgtactcaaccacatggccaacgaaagctggaaacgagacgatc 

tgaactacccgggaagtgatttacttacccaatacagccaaaatatggcttacatgaaccagcaaaaattgtttggagatttagagcaaaatcag^ 

ctctgccaatgattttcacccggctggctgcattactgattggagtaacccggggcatgttcaatactggcgcttatgtggtggtaatggtgacact 

gggttacctgatcttgatcctaactcgtgggtgatcgatcaacaaaaacgttatttacgtgctttgaaagacatgggaataaagggcttccgagttg 

atgcggtaaaacacatgagcgattaccaaatcaaccaagtgtttacgccagacatcatcgcaggcttacatgtatttggtgaagtgatcaccagtg 

gtggcaagggcagcaatgactaccactcttttctggaaccgtatttaaataacaccaatcacgccgcgtatgacttcccgctatttgcctctatccg 

aaatgcatttagttatcatggcagcttgtctcaattacatgatccacaagcttacgggcaagcacttcctaacgacagagccattactttcaccatca 

ctcacgacattccaaccaatgatggtttccgttaccaaatcatggatccaaccagtgaaaaactcgcgtacgcgtacattctaggcaaagatggg 

ggtagcccacttatctatagcgatgctttagacccaagtgaagataaagataagggccgctggcgtgatgtatggaaccaagaatacatggttaa 

catgatcagcttccacaacaaggtgcaaggtaaaagcatggaggtcatgtacagcgatcaatgcttgctggtctttaaacgtgaaaaacaaggct 

tagtcggtattaataagtgcgctgaaagccgtacctacaccatagatacccatcgttttgaatttaactggtaccaac^gto 

ccagcacagcgagacctttagcagccgttatcatgctctgaccattccggcgcaaacagcacgaatgttggcgctataa 

SEQIDNO: 144 

Met Pro Lys Ser Thr Phe Thr Lys Ser He Thr Lys Ser Leu Leu Ala Thr Ser Val Val Val Ser Leu 
Leu Pro Ala Tyr Ala Gin Ala Asp Thr He Leu His Ala Phe Asn Trp Lys Tyr Ser Asp He Thr Arg 
Gin Ala Glu Gin He Ala Gin Ala Gly Tyr Lys Lys Val Leu He Ser Pro Pro Leu Lys Ser Thr Gly Pro 
Gin Trp Tip Ala Arg Tyr Gin Pro Gin Asp He Arg Val He Asp Ser Pro Val Gly Asn Lys Gin Asp 
Leu Gin Ala Leu He Ala Ala Leu Lys Ala Gin Gly Val Glu Val Tyr Ala Asp He Val Leu Asn His 
Met Ala Asn Glu Ser Trp Lys Arg Asp Asp Leu Asn Tyr Pro Gly Ser Asp Leu Leu Thr Gin Tyr Ser 
Gin Asn Met Ala Tyr Met Asn Gin Gin Lys Leu Phe Gly Asp Leu Glu Gin Asn Gin Phe Ser Ala 
Asn Asp Phe His Pro Ala Gly Cys He Thr Asp Trp Ser Asn Pro Gly His Val Gin Tyr Trp Arg Leu 
Cys Gly Gly Asn Gly Asp Thr Gly Leu Pro Asp Leu Asp Pro Asn Ser Trp Val He Asp Gin Gin Lys 
Arg Tyr Leu Arg Ala Leu Lys Asp Met Gly He Lys Gly Phe Arg Val Asp Ala Val Lys His Met Ser 
Asp Tyr Gin He Asn Gin Val Phe Thr Pro Asp He He Ala Gly Leu His Val Phe Gly Glu Val He Thr 
Ser Gly Gly Lys Gly Ser Asn Asp Tyr His Ser Phe Leu Glu Pro Tyr Leu Asn Asn Thr Asn His Ala 
Ala Tyr Asp Phe Pro Leu Phe Ala Ser He Arg Asn Ala Phe Ser Tyr His Gly Ser Leu Ser Gin Leu 
His Asp Pro Gin Ala Tyr Gly Gin Ala Leu Pro Asn Asp Arg Ala He Thr Phe Thr He Thr His Asp 
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He Pro Thr Asn Asp Gly Phe Arg Tyr Gin He Met Asp Pro Thr Ser Glu Lys Leu Ala Tyr Ala Tyr 
He Leu Gly Lys Asp Gly Gly Ser Pro Leu He Tyr Ser Asp Ala Leu Asp Pro Ser Glu Asp Lys Asp 
Lys Gly Arg Tip Arg Asp Val Trp Asn Gin Glu Tyr Met Val Asn Met He Ser Phe His Asn Lys Val 
Gin Gly Lys Ser Met Glu Val Met Tyr Ser Asp Gin Cys Leu Leu Val Phe Lys Arg Glu Lys Gin 
Gly Leu Val Gly He Asn Lys Cys Ala Glu Ser Arg Thr Tyr Thr He Asp Thr His Arg Phe Glu Phe 
Asn Tip Tyr Gin Pro Tyr Asn Asp Thr Leu Ser Gin His Ser Glu Thr Phe Ser Ser Arg Tyr His Ala 
Leu Thr He Pro Ala Gin Thr Ala Arg Met Leu Ala Leu 

SEQ ID NO: 145 

atgttgaaaaggattacggtagtctgtttattatttattttgctttttcctaatatatatgggaggaataaggcggaagcagcaacgataaataatgga 

acattaatgcagtattttgagtggtacgctccgaatgatgggaatcattggaatcgtttgcgttatgatgctgaaagtttagctcataagggaatcac 

atctgtatggataccacctgcatataaagggacttcgcaaaatgatgtagggtatggggcctatgatttatacgatttaggggagttcaatcaaaaa 

ggaacggtgcggacgaaatatgggacaaaggcacagttgaaatctgcaattgacgctttacataagcaaaacatcgacgtatacggtgatgta 

gttatgaatcataaaggtggggctgattatactgaaaccgtaacagctgttgaggtagaccgtaacaatcgaaatattgaagtatcaggtgattatg 

aaattagtgcgtggacgggttttaactttccagggcgcagagatgcttattctaatttcaaatggaaatggtatcattttgacggaacggattgggat 

gaaggaaggaaattaaaccgaatttataaatttaggggtataggtaaagcgtgggactgggaagtgtctagcgaaaatggaaattatgattatttg 

atgtatgcagatcttgattttgatcatccagatgttgcgaatgaaatgaaaagttggggaacgtggtatgcgaatgaattaaatttagatggatttcgt 

ttagatgctgttaaacatattgatcatgaatatttacgcgattgggtaaatcatgtcagacagcaaacggggaaagaaatgtttacggtggctgaat 

attggcaaaatgatatccagactttaaacaattatttggcgaaagtcaattataatcaatctgtatttgatgcaccgcttcattacaattttcattatgctt 

caacaggaaatgggaattatgatatgagaaatattttaaatggaacagtaatgaaaaatcatcctgcactcgcagttactctcgttgagaatcatga 

ttctcaacctgggcaatcattggaatctgtagtaagtccgtggtttaagccgctggcatatgcatttattttaactcgtgcagagggctatcctt^ 

tttttatggtgattactatgggacaagcggaaatagtagttatgaaattccagcgttaaaagataaaattgatccaattttgacggcacgaaaaaact 

ttgcatatggtacgcagcgtgattatttagaccatccagatgtgattggctggacaagagaaggagatagtgtacatgctaagtctggtttagcgg 

cattaatctccgatggaccaggaggatcaaagtggatggatgttggaaagaataacgctggggaagtatggtacgatattacgggtaatca^ 

aaatactgtaacaattaataaagatggatcggggcaattccatgtaagtggaggctctgtttctatatatgttcaacagtaa 

SEQ ID NO: 146 

Met Leu Lys Arg He Thr Val Val Cys Leu Leu Phe He Leu Leu Phe Pro Asn He Tyr Gly Arg Asn 
Lys Ala Glu Ala Ala Thr He Asn Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Ala Pro Asn Asp 
Gly Asn His Trp Asn Arg Leu Arg Tyr Asp Ala Glu Ser Leu Ala His Lys Gly He Thr Ser Val Tip 
He Pro Pro Ala Tyr Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Leu Lys Ser Ala He 
Asp Ala Leu His Lys Gin Asn He Asp Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
Tyr Thr Glu Thr Val Thr Ala Val Glu Val Asp Arg Asn Asn Arg Asn lie Glu Val Ser Gly Asp Tyr 
Glu He Ser Ala Trp Thr Gly Phe Asn Phe Pro Gly Arg Arg Asp Ala Tyr Ser Asn Phe Lys Trp Lys 
Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Gly Arg Lys Leu Asn Arg He Tyr Lys Phe Arg Gly 
lie Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp 
Leu Asp Phe Asp His Pro Asp Val Ala Asn Glu Met Lys Ser Trp Gly Thr Trp Tyr Ala Asn Glu Leu 
Asn Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Asp His Glu Tyr Leu Arg Asp Tip Val Asn 
His Val Arg Gin Gin Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp He Gin Thr 
Leu Asn Asn Tyr Leu Ala Lys Val Asn Tyr Asn Gin Ser Val Phe Asp Ala Pro Leu His Tyr Asn 
Phe His Tyr Ala Ser Thr Gly Asn Gly Asn Tyr Asp Met Arg Asn He Leu Asn Gly Thr Val Met Lys 
Asn His Pro Ala Leu Ala Val Thr Leu Val Glu Asn His Asp Ser Gin Pro Gly Gin Ser Leu Glu Ser 
Val Val Ser Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Ala Glu Gly Tyr Pro Ser 
Val Phe Tyr Gly Asp Tyr Tyr Gly Thr Ser Gly Asn Ser Ser Tyr Glu He Pro Ala Leu Lys Asp Lys 
He Asp Pro lie Leu Thr Ala Arg Lys Asn Phe Ala Tyr Gly Thr Gin Arg Asp Tyr Leu Asp His Pro 
Asp Val He Gly Trp Thr Arg Glu Gly Asp Ser Val His Ala Lys Ser Gly Leu Ala Ala Leu He Ser 
Asp Gly Pro Gly Gly Ser Lys Trp Met Asp Val Gly Lys Asn Asn Ala Gly Glu Val Trp Tyr Asp He 
Thr Gly Asn Gin Thr Asn Thr Val Thr He Asn Lys Asp Gly Ser Gly Gin Phe His Val Ser Gly Gly 
Ser Val Ser He Tyr Val Gin Gin 
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SEQIDNO: 147 

atgagcttaaataactttaaggtaaaactgcttag^mgctgtgtcttctgctgtattgtcactggctccaaatttagccaat^^ 

tgagatggtgataatccatccgmcagtggacatatgacaatatagcaaaagagtgtacagagtaccnggtccagccggam 

gamcccagccagcggaacataagcgggctgaaggagtatggtgggccgtatatcagccggttaattataagaattttacaaccatgac^ 

acgaggagcagcttaaggcaatgatcaagacctgtaatgatgcaggtgttaaggtgttcgctgacgctgttttcaaccaaaaggctacagacgg 

tgtaggctggggcggttcaacttggagltataagaactaccctgacggattctccggatcagamccatggagactgttccattgacaaaag 

actgatgcaaataatgtcagaacctgtgcactctcaggtatgccggacgttgccacagataactccgctactcaggaaaagattgcagattacct 

cgcttctttaatgaatatgggggtctatggtttccgtattgacgctgcaaagcacatgggatacaacgatatcaactccattctttcaaaaactgcac 

agaagactggaagaagacctcctgcatatctggaagtaatcggagccggtaacgaagctgccgacattcagccggacaagtatacctttattga 

gaatgcggttgtaactgacttcggttatgtctgggatgcaaatgagagmcggaaagggtaattacggtaaggcactggaactca^^ 

cggtgcaaattcagaaacattcgtaaacaatcatgatgatgaatggggcagatgctcagccggtagctgctcaatgaaaactcagaattatgctg 

attataatctggctcagtcctggcUgctgtatggcctgtaggtacagtaagacagatataftccggttaftcattccctgta 

tcgcgtcagtgatgcaactcatgatcagggcgggcctcttggtgccgaccgctgtgaaggtggctggttgtgtcagcaccgtgtgtccttc^^ 

caattccccaagatttgcgagagctaccagaggtactgctgtatcaaccaagggatttgacaatggtgctttgtggtttaacagaggaagcaaag 

gtttttatgcacagaatactaccaacagtcctataacccagacattctctgUgaagtacctgacggaaattactgtgatatcttaggaacatca 

cctaagagcaatccatgcggagcagacgttgtcgtaagcggcggtaaggctacctttactattcctgcaaagacagctgtggctatctgtacaga 

ctcagactggtgcggcaagggggttgatccttgtgaaagtgatccgaccggtgctgcctgtgmgtaagggggaaaccaccgttaatggtgtgt 

gcgtcagctggtgtaatgcgcattcatcaaatgaggaatgcacctgtgtattgaatccgaatgatgccaactgtcaggctgatattgaacctac^^ 

agggtaaactctgttacgccggtacncaaacgggtggaaacaggatccmaacatataaccgtaaaacaggtUctggactattaato 

gacggtgcaggtgataccagcggagctcagcgcttcaaggttacagacggatgttcatggaccggaacagtttacggncttcaggtactg^ 

gaaagttggatgtaaatacatcatcaaccggcgatgaacctgtgtctcttgttggtgattatgttcmccattaacgataagaccatggaatataca^ 

tcaccaaggcagatgaagtaactaatcagccaccggttgcatcatttaccgcgacagttaacggtctgaccgtttcttttgccaataattcatccga 

ccctgagaatgatgaattaacctacagctggaamcggtaatggtaaaacatcatccgagaaagctcctagcataacctatgaagaa^^ 

agtatactgttactttaaaggUactgancagctaataacactgatacamactaaagatataactgtaacagcaccttctagtggcaagtact^ 

ggngcagtcagaggncgcatgataattacggaactgatctgttaaccaagaacggnctgaUggaccggcgtcmgaattctttggatccacta 

gtgtcgacctgcaggcgcgcgagctc 

SEQ ID NO: 148 

Met Ser Leu Asn Asn Phe Lys Val Lys Leu Leu Ser Phe Ala Val Ser Ser Ala Val Leu Ser Leu Ala 
Pro Asn Leu Ala Asn Ala Ala Asn Phe Glu Ser Glu Met Val He He His Pro Phe Gin Trp Thr Tyr 
Asp Asn He Ala Lys Glu Cys Thr Glu Tyr Leu Gly Pro Ala Gly Phe Asp Gly Val Gin He Ser Gto 
Pro Ala Glu His Lys Arg Ala Glu Gly Val Trp Trp Ala Val Tyr Gin Pro Val Asn Tyr Lys Asn Phe 
Thr Thr Met Thr Gly Asn Glu Glu Gin Leu Lys Ala Met He Lys Thr Cys Asn Asp Ala Gly Val Lys 
Val Phe Ala Asp Ala Val Phe Asn Gin Lys Ala Thr Asp Gly Val Gly Trp Gly Gly Ser Thr Tip Ser 
Tyr Lys Asn Tyr Pro Asp Gly Phe Ser Gly Ser Asp Phe His Gly Asp Cys Ser He Asp Lys Ser Tyr 
Thr Asp Ala Asn Asn Val Arg Thr Cys Ala Leu Ser Gly Met Pro Asp Val Ala Thr Asp Asn Ser Ala 
Thr Gin Glu Lys He Ala Asp Tyr Leu Ala Ser Leu Met Asn Met Gly Val Tyr Gly Phe Arg He Asp 
Ala Ala Lys His Met Gly Tyr Asn Asp He Asn Ser He Leu Ser Lys Thr Ala Gin Lys Thr Gly Arg 
Arg Pro Pro Ala Tyr Leu Glu Val He Gly Ala Gly Asn Glu Ala Ala Asp He Gin Pro Asp Lys Tyr 
Thr Phe He Glu Asn Ala Val Val Thr Asp Phe Gly Tyr Val Trp Asp Ala Asn Glu Ser Phe Gly Lys 
Gly Asn Tyr Gly Lys Ala Leu Glu Leu Ser Thr Trp Leu Gly Ala Asn Ser Glu Thr Phe Val Asn Asn 
His Asp Asp Glu Trp Gly Arg Cys Ser Ala Gly Ser Cys Ser Met Lys Thr Gin Asn Tyr Ala Asp Tyr 
Asn Leu Ala Gin Ser Trp Leu Ala Val Trp Pro Val Gly Thr Val Arg Gin He Tyr Ser Gly Tyr Ser 
Phe Pro Val Lys Asp Asn Asp Pro Tyr Arg Val Ser Asp Ala Thr His Asp Gin Gly Gly Pro Leu Gly 
Ala Asp Arg Cys Glu Gly Gly Trp Leu Cys Gin His Arg Val Ser Phe Val Leu Asn Ser Pro Arg Phe 
Ala Arg Ala Thr Arg Gly Thr Ala Val Ser Thr Lys Gly Phe Asp Asn Gly Ala Leu Tip Phe Asn Arg 
Gly Ser Lys Gly Phe Tyr Ala Gin Asn Thr Thr Asn Ser Pro He Thr Gin Thr Phe Ser Val Glu Val 
Pro Asp Gly Asn Tyr Cys Asp He Leu Gly Thr Ser Asp Pro Lys Ser Asn Pro Cys Gly Ala Asp Val 
Val Val Ser Gly Gly Lys Ala Thr Phe Thr He Pro Ala Lys Thr Ala Val Ala He Cys Thr Asp Ser 
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Asp Trp Cys Gly Lys Gly Val Asp Pro Cys Glu Ser Asp Pro Thr Gly Ala Ala Cys Val Cys Lys Gly 
Glu Thr Thr Val Asn Gly Val Cys Val Ser Trp Cys Asn Ala His Ser Ser Asn Glu Glu Cys Thr Cys 
Val Leu Asn Pro Asn Asp Ala Asn Cys Gin Ala Asp lie Glu Pro Thr Lys Gly Lys Leu Cys Tyr Ala 
Gly Thr Ser Asn Gly Trp Lys Gin Asp Pro Leu Thr Tyr Asn Arg Lys Thr Gly Phe Trp Thr He Asn 
Leu Thr Leu Asp Gly Ala Gly Asp Thr Ser Gly Ala Gin Arg Phe Lys Val Thr Asp Gly Cys Ser Trp 
Thr Gly Thr Val Tyr Gly Ser Ser Gly Thr Ala Gly Lys Leu Asp Val Asn Thr Ser Ser Thr Gly Asp 
Glu Pro Val Ser Leu Val Gly Asp Tyr Val Leu Ser He Asn Asp Lys Thr Met Glu Tyr Thr Phe Thr 
Lys Ala Asp Glu Val Thr Asn Gin Pro Pro Val Ala Ser Phe Thr Ala Thr Val Asn Gly Leu Thr Val 
Ser Phe Ala Asn Asn Ser Ser Asp Pro Glu Asn Asp Glu Leu Thr Tyr Ser Trp Asn Phe Gly Asn Gly 
Lys Thr Ser Ser Glu Lys Ala Pro Ser He Thr Tyr Glu Glu Ser Gly Lys Tyr Thr Val Thr Leu Lys 
Val Thr Asp Ser Ala Asn Asn Thr Asp Thr Phe Thr Lys Asp He Thr Val Thr Ala Pro Ser Ser Gly 
Lys Tyr Leu Lys Val Ala Val Arg Gly Ser His Asp Asn Tyr Gly Thr Asp Leu Leu Thr Lys Asn Gly 
Ser Asp Trp Thr Gly Val Phe Glu Phe Phe Gly Ser Thr Ser Val Asp Leu Gin Ala Arg Glu Leu 

SEQIDNO: 149 

atgatcttaagtaattttaaggtaaaacUcttagttttgctgtgtcttctgctgtactgacactggctgcaaatgtcgccaatgc^ 

gtgaaatggttattattcatccatttcagtggacatatgacaatatagcaaaagaatgtactgagtatctgggacctgcgggatttgacggggtgca 

gamcccaggcggctgagcataaagatgccggtggtgcatggtggggtacctaccagcctgtaaacttcaagagttttactaccatggtt 

atgaagaacagcttagagcaatgattaaaacctgtaacgaggcaggtgttaaggtctttgccgatgccgtgattaatcagaaagccggcgacgg 

tgtaggtataggtggttcaacmcggaaattataattatcctgacggatttaccagtgatgatmcatcataataactgcagtataggtaa 

cagatgcatgggtagtaagattctgtgacctcagtggcatgccggatatagcaactgataacgacagtaccagaaataagattgctgattacttcg 

ccagccttatgaatatgggggtatacggattccgtattgatgctgccaagcactttagctatgatgatatagacgctattgtagagaaaacagcaa 

ccaaagcaggcaggagacctcctgtctatatggaggttatcggtaatccgggtcaagaggcggatgatatccagccgaacaagtatacatgga 

ttgataatgccgttgtaacagattttacttatgctaatagcatgcataatatttttaacggaagcggttatgcca^ 

atgttgatgctgaaaatgccgaagtctttataagtaatcatgataatgaatggggaagaaagtctgccggttcctgctcaataagaacccagaate 

atccggattaccatctggctcagtcctggctcgcagtnggcctttaggcaaggttagacagatttaUctgcatatcagUcccggtcm 

gttgtgagcgggtcagtcagcaagcccatgatcagggcggtcctatcggggcagcccgctgtgaagg^ggctgg^tgtgtcagca 

cgtttgtgctcaattctcctagatttgcaagagcaaccagagggacagtcgttactactaaaggttttgatgacggagctttgtggtttaacagagg 

aagcaagggcftctatgcccagaatactaccggcagftctataactcatacaUctcagngaattacctgatggaaattactgtgat^^ 

caaccgatccgaagaataatccttgcggagcggatgtcactgtaagcggaggtaaagcaacctttaccattccggcaaagaccgccgtagcte 

tctgtactgatgaaaagtggtgtggcaagggggttgacccttgtgaaagcgatcctaccggttccgcctgtgtatgtaagggtgaaaccacagtt 

aacggcgtatgtgtaagctggtgtaatgctcactcatctaatgaagaatgtgcctgtgtgctaaatcctaatgacgctgagtgtcag^^ 

gagccgaccaagggtaaactctgctatgmggtacctccaacaagtggactcaggaacctttaacctataatcgcaagaccggmctggac^^ 

caacgttgaacttgacggtaagggggataccagcggggcgcagcgctttaaagttaccgacggctgttcatggcagggtactgtttacggttca 

tcaggagtagaaggcagacttgacgtaaatacUcagccaccggagatgaaccggmcactgacaggtaaatatgttctttccataaatgataag 

accatggaatacacattcattcctgcaggcagtggaaacaagcctccggngcgtcatttactccgactgttaaagatetgactgtate 

taattcatccgaccctgagaatgatgaattaacctacagctggaatttcggtaacggtaaaacctcatctgaaaagaatccgagtgttacatatgat 

aaagccggtaaatatactgmcactcaaagtaaccgatactgcaaacaacactgataccaaaacactggaaatcgatttaacatctcctgttaacg 

gaaaatanccaaggttgcagtcagaggncacatgataactacggaacaaatctgttaaccaggaatggttcagaatgg^^ 

cagtaagacaaccaaattcaagcttgaagctctgcctcctgcagctgaccagtgtatcttcctcggcggtaatcgaggtgaggcattgactgcct 

ccggtggamatatctcttcctgccggaaggtatactataaagtttaatgaggaaagcaaggttcttactgcaggcgatgttgactgcaccggg 

SEQIDNO: 150 

Met He Leu Ser Asn Phe Lys Val Lys Leu Leu Ser Phe Ala Val Ser Ser Ala Val Leu Thr Leu Ala 
Ala Asn Val Ala Asn Ala Lys Asn Tyr Glu Ser Glu Met Val He He His Pro Phe Gin Trp Thr Tyr 
Asp Asn He Ala Lys Glu Cys Thr Glu Tyr Leu Gly Pro Ala Gly Phe Asp Gly Val Gin He Ser Gin 
Ala Ala Glu His Lys Asp Ala Gly Gly Ala Trp Trp Gly Thr Tyr Gin Pro Val Asn Phe Lys Ser Phe 
Thr Thr Met Val Gly Asn Glu Glu Gin Leu Arg Ala Met He Lys Thr Cys Asn Glu Ala Gly Val Lys 
Val Phe Ala Asp Ala Val He Asn Gin Lys Ala Gly Asp Gly Val Gly He Gly Gly Ser Thr Phe Gly 
Asn Tyr Asn Tyr Pro Asp Gly Phe Thr Ser Asp Asp Phe His His Asn Asn Cys Ser He Gly Asn Asn 
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Tyr Ser Asp Ala Trp Val Val Arg Phe Cys Asp Leu Ser Gly Met Pro Asp He Ala Thr Asp Asn Asp 
Ser Thr Arg Asn Lys He Ala Asp Tyr Phe Ala Ser Leu Met Asn Met Gly Val Tyr Gly Phe Arg He 
Asp Ala Ala Lys His Phe Ser Tyr Asp Asp He Asp Ala He Val Glu Lys Thr Ala Thr Lys Ala Gly 
Arg Arg Pro Pro Val Tyr Met Glu Val He Gly Asn Pro Gly Gin Glu Ala Asp Asp He Gin Pro Asn 
Lys Tyr Thr Trp He Asp Asn Ala Val Val Thr Asp Phe Thr Tyr Ala Asn Ser Met His Asn He Phe 
Asn Gly Ser Gly Tyr Ala Lys Ala Leu Asn Met Gly Leu Gly His Val Asp Ala Glu Asn Ala Glu Val 
Phe He Ser Asn His Asp Asn Glu Trp Gly Arg Lys Ser Ala Gly Ser Cys Ser He Arg Thr Gin Asn 
Asn Pro Asp Tyr His Leu Ala Gin Ser Trp Leu Ala Val Trp Pro Leu Gly Lys Val Arg Gin He Tyr 
Ser Ala Tyr Gin Phe Pro Val Phe Glu Asp Ser Cys Glu Arg Val Ser Gin Gin Ala His Asp Gin Gly 
Gly Pro He Gly Ala Ala Arg Cys Glu Gly Gly Trp Leu Cys Gin His Arg Val Pro Phe Val Leu Asn 
Ser Pro Arg Phe Ala Arg Ala Thr Arg Gly Thr Val Val Thr Thr Lys Gly Phe Asp Asp Gly Ala Leu 
Trp Phe Asn Arg Gly Ser Lys Gly Phe Tyr Ala Gin Asn Thr Thr Gly Ser Ser He Thr His Thr Phe 
Ser Val Glu Leu Pro Asp Gly Asn Tyr Cys Asp He Leu Gly Ala Thr Asp Pro Lys Asn Asn Pro Cys 
Gly Ala Asp Val Thr Val Ser Gly Gly Lys Ala Thr Phe Thr He Pro Ala Lys Thr Ala Val Ala He 
Cys Thr Asp Glu Lys Trp Cys Gly Lys Gly Val Asp Pro Cys Glu Ser Asp Pro Thr Gly Ser Ala Cys 
Val Cys Lys Gly Glu Thr Thr Val Asn Gly Val Cys Val Ser Trp Cys Asn Ala His Ser Ser Asn Glu 
Glu Cys Ala Cys Val Leu Asn Pro Asn Asp Ala Glu Cys Gin Ala Asp He Glu Pro Thr Lys Gly Lys 
Leu Cys Tyr Val Gly Thr Ser Asn Lys Trp Thr Gin Glu Pro Leu Thr Tyr Asn Arg Lys Thr Gly Phe 
Trp Thr Leu Asn Val Glu Leu Asp Gly Lys Gly Asp Thr Ser Gly Ala Gin Arg Phe Lys Val Thr 
Asp Gly Cys Ser Tip Gin Gly Thr Val Tyr Gly Ser Ser Gly Val Glu Gly Arg Leu Asp Val Asn Thr 
Ser Ala Thr Gly Asp Glu Pro Val Ser Leu Thr Gly Lys Tyr Val Leu Ser He Asn Asp Lys Thr Met 
Glu Tyr Thr Phe He Pro Ala Gly Ser Gly Asn Lys Pro Pro Val Ala Ser Phe Thr Pro Thr Val Lys 
Asp Leu Thr Val Ser Phe Val Asn Asn Ser Ser Asp Pro Glu Asn Asp Glu Leu Thr Tyr Ser Tip Asn 
Phe Gly Asn Gly Lys Thr Ser Ser Glu Lys Asn Pro Ser Val Thr Tyr Asp Lys Ala Gly Lys Tyr Thr 
Val Ser Leu Lys Val Thr Asp Thr Ala Asn Asn Thr Asp Thr Lys Thr Leu Glu He Asp Leu Thr Ser 
Pro Val Asn Gly Lys Tyr Ser Lys Val Ala Val Arg Gly Ser His Asp Asn Tyr Gly Thr Asn Leu Leu 
Thr Arg Asn Gly Ser Glu Trp Thr Gly He Phe Glu Phe Ser Lys Thr Thr Lys Phe Lys Leu Glu Ala 
Leu Pro Pro Ala Ala Asp Gin Cys He Phe Leu Gly Gly Asn Arg Gly Glu Ala Leu Thr Ala Ser Gly 
Gly Phe He Ser Leu Pro Ala Gly Arg Tyr Thr He Lys Phe Asn Glu Glu Ser Lys Val Leu Thr Ala 
Gly Asp Val Asp Cys Thr Gly 

SEQIDNO: 151 

atgaaaactattctttcaacaatcatggtgatggcggctgcggctgccaccaccgtagaggctcaaggctggccggaaaactacggcggcgtc 

atgttgcagggattctactgggattcctattcagccaccaagtggactaaactggaagcacaggctgacgagatctgcaactatttctcgctggta 

tgggtaccacagtcggcctataccggcagcagtacctccatgggctacgacccgctgtattacttcgaccagcattcatcgttcggcaccgaag 

agcagctacggtcgttcatcagtacctacaagcagaaaggaactggcatcatagccgatgtagttgtcaatcaccgaaagaatgtctcaaactg 

ggtggatttcccggccgagacctacaacggtgtaacctatcagatggtaagcaccgacatcgtttcgaacgatgacggcggaaaaacagcca 

cttgggcaaatcaaaacggctacagtctctcctccaatgccgacgaaggcgaaggctgggacggcatgcgcgacctggaccacaagtcgca 

gaacgtgcagaaatcggttcttgcctacaccaaatatctggttgacgacttaggctataccggattccgctacgatatggtaaagggatttgacgg 

atcgcatgtagccgactacaacaccaatgccggcgtgcagttctctgtcggcgaatattgggacggcactgcatcgaaagtttacagttggatca 

acagcaccaaaaagagcgatgtgccgcagtcggcagccttcgacttcgctttccgatacacctgccgcgatgccgtcaacaacaagaactgg 

gcgaacctgaagaacacttccggtatcagcgatgccgattacaggcgctattcggttacgtttgttgaaaatcacgatacggaataccgttcagct 

acggcttcccaggatcccatcaagggtgatacggttgccctcaatgcctggatgctggctatgccgggcacaccttgtgttttcctgaaacattgg 

accgactgcaaggaagagatcaagaatctcatcgaggcacgtcgcctggtcggtattcacaaccagagcacctatgccgaatggatgagcgg 

tgcagcctacatcggacgtaccgtaacaggtacgaacggcaccttacgtgttctgtgcggctcttatcagtataatgtagccgccaactacattca 

gattctctcaggcaaaaactataaatactacgtactcaacacgctcgaggctccctggatcgggaaaggttccggctcgtacaccgaaggtgaa 

accgtaacggttccgctcatcgccatatcggccgatgccaatgccaagctggtatataccaccgacggcacagaccccaccgcaacctcaaca 

gccgtaaccagcggaacggaactgaccatcacttcggacgccgtcctgaaggttggtctgctttccggcggcatcgtcaggaacatacagagc 

cgtacattcaccttccaggctgcaaacacctccgagtattacacagccaccatgcacgtatgcaaccagtccggagctctcaatccgctgtttgc 

ctatgtttgggcaggaccggacaacgagcagattaacggcaactggccgggcaccaagctcaccgctaccattaccgaaaacaaccttacct 
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ggtacacgcagtcgttccagattccgaagaacgtggactatgtcgtgaactttgttttcaccacaaccggcggcggtacgcagacagtggatgtt 
accggcatgaaggccgatgtctggtacattattaacag^accaagagcggcaacaagtacacggtaaccgacgttacctcacagtattcttcgtt 
agaggccatcmgatgaagaaaactccggctcxttccctgtctatgacctgcagggacgccgcgtcagcgaaattagaaacaggacaattatat 
cttcagaacggaaagaagatactcatcagataaacagaggttccgaaccattctcctattatgaaaatcagacacttagtaatctcagcactgc^^ 
ggtttggggggcttgtacaccatcagctgctcctcgtcggg 

SEQIDNO: 152 

Met Lys Thr He Leu Ser Thr He Met Val Met Ala Ala Ala Ala Ala Thr Thr Val Glu Ala Gin Gly 
Trp Pro Glu Asn Tyr Gly Gly Val Met Leu Gin Gly Phe Tyr Tip Asp Ser Tyr Ser Ala Thr Lys Trp 
Thr Lys Leu Glu Ala Gin Ala Asp Glu He Cys Asn Tyr Phe Ser Leu Val Trp Val Pro Gin Ser Ala 
Tyr Thr Gly Ser Ser Thr Ser Met Gly Tyr Asp Pro Leu Tyr Tyr Phe Asp Gin His Ser Ser Phe Gly 
Thr Glu Glu Gin Leu Arg Ser Phe lie Ser Thr Tyr Lys Gin Lys Gly Thr Gly He He Ala Asp Val Val 
Val Asn His Arg Lys Asn Val Ser Asn Tip Val Asp Phe Pro Ala Glu Thr Tyr Asn Gly Val Thr Tyr 
Gin Met Val Ser Thr Asp He Val Ser Asn Asp Asp Gly Gly Lys Thr Ala Thr Trp Ala Asn Gin Asn 
Gly Tyr Ser Leu Ser Ser Asn Ala Asp Glu Gly Glu Gly Tip Asp Gly Met Arg Asp Leu Asp His 
Lys Ser Gin Asn Val Gin Lys Ser Val Leu Ala Tyr Thr Lys Tyr Leu Val Asp Asp Leu Gly Tyr Thr 
Gly Phe Arg Tyr Asp Met Val Lys Gly Phe Asp Gly Ser His Val Ala Asp Tyr Asn Thr Asn Ala 
Gly Val Gin Phe Ser Val Gly Glu Tyr Trp Asp Gly Thr Ala Ser Lys Val Tyr Ser Tip He Asn Ser 
Thr Lys Lys Ser Asp Val Pro Gin Ser Ala Ala Phe Asp Phe Ala Phe Arg Tyr Thr Cys Arg Asp Ala 
Val Asn Asn Lys Asn Trp Ala Asn Leu Lys Asn Thr Ser Gly He Ser Asp Ala Asp Tyr Arg Arg Tyr 
Ser Val Thr Phe Val Glu Asn His Asp Thr Glu Tyr Arg Ser Ala Thr Ala Ser Gin Asp Pro He Lys 
Gly Asp Thr Val Ala Leu Asn Ala Tip Met Leu Ala Met Pro Gly Thr Pro Cys Val Phe Leu Lys His 
Trp Thr Asp Cys Lys Glu Glu He Lys Asn Leu He Glu Ala Arg Arg Leu Val Gly He His Asn Gin 
Ser Thr Tyr Ala Glu Tip Met Ser Gly Ala Ala Tyr He Gly Arg Thr Val Thr Gly Thr Asn Gly Thr 
Leu Arg Val Leu Cys Gly Ser Tyr Gin Tyr Asn Val Ala Ala Asn Tyr He Gin He Leu Ser Gly Lys 
Asn Tyr Lys Tyr Tyr Val Leu Asn Thr Leu Glu Ala Pro Trp He Gly Lys Gly Ser Gly Ser Tyr Thr 
Glu Gly Glu Thr Val Thr Val Pro Leu He Ala He Ser Ala Asp Ala Asn Ala Lys Leu Val Tyr Thr 
Thr Asp Gly Thr Asp Pro Thr Ala Thr Ser Thr Ala Val Thr Ser Gly Thr Glu Leu Thr He Thr Ser 
Asp Ala Val Leu Lys Val Gly Leu Leu Ser Gly Gly He Val Arg Asn He Gin Ser Arg Thr Phe Thr 
Phe Gin Ala Ala Asn Thr Ser Glu Tyr Tyr Thr Ala Thr Met His Val Cys Asn Ghi Ser Gly Ala Leu 
Asn Pro Leu Phe Ala Tyr Val Trp Ala Gly Pro Asp Asn Glu Gin He Asn Gly Asn Tip Pro Gly Thr 
Lys Leu Thr Ala Thr He Thr Glu Asn Asn Leu Thr Trp Tyr Thr Gin Ser Phe Gin He Pro Lys Asn 
Val Asp Tyr Val Val Asn Phe Val Phe Thr Thr Thr Gly Gly Gly Thr Gin Thr Val Asp Val Thr Gly 
Met Lys Ala Asp Val Tip Tyr He He Asn Ser Thr Lys Ser Gly Asn Lys Tyr Thr Val Thr Asp Val 
Thr Ser Gin Tyr Ser Ser Leu Glu Ala He Phe Asp Glu Glu Asn Ser Gly Ser Phe Pro Val Tyr Asp 
Leu Gin Gly Arg Arg Val Ser Glu He Arg Asn Arg Thr He He Ser Ser Glu Arg Lys Glu Asp Thr 
His Gin He Asn Arg Gly Ser Glu Pro Phe Ser Tyr Tyr Glu Asn Gin Thr Leu Ser Asn Leu Ser Thr 
Ala Gly Phe Gly Gly Leu Val His His Gin Leu Leu Leu Val Gly 

SEQroNO: 69 

atgttgaaaaggattacggtagtctgtttattgtttattttgctttttcctaatatatatgagggaaataaggcagaagcagcaacagtgaacaatgga 

acattaatgcagtattttgagtggtacgctccgaatgatgggaatcattggaatcgtttgcgttccgatgctgaaagtttagctcataaaggaatcac 

atctgtatggataccacctgcatataaagggacttcgcaaaatgatgtagggtatggggcctatgatttatatgatttaggggagttcaatcaaaaa 

ggaacggtgcggacgaaatatgggacaaaagcacagttgaaatctgcaattgacgctttacataagcaaaacatcgacgtatacggtgatgtag 

ttatgaatcataaaggtggggctgattatactgaaaccgtaacagctgttgaggtagaccgtaacaatcgaaatattgaagtatcaggtgattatca 

aattagtgcatggacggggtttaattttccagggcgcggagatgcttattctaatttcaaatggaaatggtatcattttgacggaacggattgggatg 

aaggaaggaaattaaatcgaatttataaatttaggggtgtagataaagcgtgggattgggaagtgtctagcgaaaatggaaattatgattatttgat 

gtatgcagatcttgattttgatcatcctgatgttgcgaatgagatgaaaaattggggaacatggtatgcgaatgaattaaatttagatggctttcgttt 

ggacgctgttaaacatattgatcatgaatatttacgcgattgggtaaatcatgccagacagcaaacggggaaagaaatgtttacagtagctgaata 

ttggcaaaatgatgttcaggctttaaacaattatttagcgaaagtcaattataatcaatctg^gtttgatgcaccgcttcattacaattttcattatgcttc 
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aacaggaaatgggaattatgatatgagaaatattttaaatggaacagtaatgaaaaatcaccctgcactcgcagttactctc^ 

tctcagcctgggcagtcattggaatctgtagtaagtccgtggtttaagccgctggcatatgcatttattttaactcgtg^ 

ttctatggtgattactatgggacaagcggaaatagtagttatgaaattccagcgttaaaagataaaattgatccaattttgacggcacgaaaaaactt 

tgcatatggtacgcagcgtgattamagaccatccagatgtgattggctggacaagagaaggcgatggtgtacatgctaattctggtto 

attactctcggacggaccaggaggatcaaagtggatggatgttggaaagaataacgctggggaagtatggtacgatattacgggtaatcaa^ 

aaatactgtaacaattaataaggacggatgggggcagttctatgtaagtggcggctcagtttccatatatgttcagcggtaa 

SEQIDNO:70 

Met Leu Lys Arg He Thr Val Val Cys Leu Leu Phe He Leu Leu Phe Pro Asn He Tyr Glu Gly Asn 
Lys Ala Glu Ala Ala Thr Val Asn Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Ala Pro Asn Asp 
Gly Asn His Trp Asn Arg Leu Arg Ser Asp Ala Glu Ser Leu Ala His Lys Gly He Thr Ser Val Tip 
He Pro Pro Ala Tyr Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Leu Lys Ser Ala He 
Asp Ala Leu His Lys Gin Asn He Asp Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
Tyr Thr Glu Thr Val Thr Ala Val Glu Val Asp Arg Asn Asn Arg Asn He Glu Val Ser Gly Asp Tyr 
Gin He Ser Ala Trp Thr Gly Phe Asn Phe Pro Gly Arg Gly Asp Ala Tyr Ser Asn Phe Lys Tip Lys 
Trp Tyr His Phe Asp Gly Thr Asp Tip Asp Glu Gly Arg Lys Leu Asn Arg He Tyr Lys Phe Arg Gly 
Val Asp Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp 
Leu Asp Phe Asp His Pro Asp Val Ala Asn Glu Met Lys Asn Trp Gly Thr Tip Tyr Ala Asn Glu 
Leu Asn Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Asp His Glu Tyr Leu Arg Asp Trp Val 
Asn His Ala Arg Gin Gin Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Tip Gin Asn Asp Val Gin 
Ala Leu Asn Asn Tyr Leu Ala Lys Val Asn Tyr Asn Gin Ser Val Phe Asp Ala Pro Leu His Tyr Asn 
Phe His Tyr Ala Ser Thr Gly Asn Gly Asn Tyr Asp Met Arg Asn lie Leu Asn Gly Thr Val Met Lys 
Asn His Pro Ala Leu Ala Val Thr Leu Val Glu Asn His Asp Ser Gin Pro Gly Gin Ser Leu Glu Ser 
Val Val Ser Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr Arg Ala Glu Gly Tyr Pro Ser 
Val Phe Tyr Gly Asp Tyr Tyr Gly Thr Ser Gly Asn Ser Ser Tyr Glu He Pro Ala Leu Lys Asp Lys 
He Asp Pro He Leu Thr Ala Arg Lys Asn Phe Ala Tyr Gly Thr Gin Arg Asp Tyr Leu Asp His Pro 
Asp Val He Gly Trp Thr Arg Glu Gly Asp Gly Val His Ala Asn Ser Gly Leu Ala Thr Leu Leu Ser 
Asp Gly Pro Gly Gly Ser Lys Trp Met Asp Val Gly Lys Asn Asn Ala Gly Glu Val Tip Tyr Asp He 
Thr Gly Asn Gin Thr Asn Thr Val Thr He Asn Lys Asp Gly Tip Gly Gin Phe Tyr Val Ser Gly Gly 
Ser Val Ser He Tyr Val Gin Arg 

SEQID NO: 153 

ttgccttcaattaatgcaagcgattgcaaaaaaaagggagataggagtatgaagaggaaaaaatggactgcgttagcactatctttaccactagtt 

atgagcttatcaacaaacatacaagcagaaacattacataataataagggtcaaaaggcgcaaacaggaaataaagacggaattttttatgaact 

gtatgttaattctttttatgatactgatagcaatggacatggtgatttaaaaggcgtcacaaagaaacttgattatttaaatgatggaaatccaagaac 

aaataatgatcttcaaataaacggtatctggatgatgcctattaacacctctcctagttatcacaaatatgatgtaacagattactataatatcgatcct 

cagtatggaagtttacaagatttccgtgaactaacaacagaagcgcataaacgcaacgtaaaggtagtaatagatcttgttattaatcatacaagc 

agtgagcatccttggtttgtcgatgcattaaaaaataaaaacagtaagtatcgagattactatatttgggctgataaaaatacagacttaaatgaaaa 

aggcccatggggtcaacaagtatggcacaaagcgtcgaacggagagtatttctacgcaacgttctgggaagggatgccggacttaaactatga 

caaccctaaagtaagagaagaaatgattaaaatcgggaaattttggctcaaacaaggagctgatggctttcgtctagatgcagccatgcacatctt 

taaagggcaaacacctgaaggagcaaagaaaaatattgaatggtggaatgaattccgcgacgcgatgagagaaacgaatccaaatacgtatct 

agttggtgaaatatgggatcaaccagaagtagttgctccgtattatcaatcgttagattctacatttaacttcgacttagcatataaaatcgt^ 

gttaaaaatggtactgatcaaggggtagccgcggcagctgttgcaacggatgagttatataaaacatataatccaaataaaattgatggaacgttt 

ttaacgaatcatgaccaaaatcgtgtaatgagtgagttaaatggtgatgtaaacaaagcaaaatcagcagcctctattctgttgacactccctggta 

atccgttcatttattatggcgaagaaatcggcatgacaggccaaaaaccagatgagttgattcgtgagcctttccgttggtatgaagatgataaag 

aaggtcaaacgagctgggagactccagtatataacattgatcataatggtgtttcagttgaagcacaagataaacaaaaagcttctcttctaagcc 

attatcgtaaaatgattcgtgttcgtcagcaacacgatgaacttgtcaaaggtaatttagaacctatttctgtcaataattcacaggttgttgcctataat 

cgtacgtataaaaataaatcaattcaagtgtaccataatatttcagacaagccggttacattaactgtttcaaacaaaggaaaactgattttttctagt 

gaattaggagcaaaaaaggaaaaatcaacattagtaattccagcgaatacgacagtgctagtaaagtaa 



wo 02/068589 
Figure 16 (cont.) 



90/116 



PCT/US02/05068 



SEQIDNO: 154 

Met Pro Set He Asn Ala Ser Asp Cys Lys Lys Lys Gly Asp Arg Ser Met Lys Arg Lys Lys Trp Thr 
Ala Leu Ala Leu Ser Leu Pro Leu Val Met Ser Leu Ser Thr Asn He Gin Ala Glu Thr Leu His Asn 
Asn Lys Gly Gin Lys Ala Gin Thr Gly Asn Lys Asp Gly He Phe Tyr Glu Leu Tyr Val Asn Ser Phe 
Tyr Asp Thr Asp Ser Asn Gly His Gly Asp Leu Lys Gly Val Thr Lys Lys Leu Asp Tyr Leu Asn 
Asp Gly Asn Pro Arg Thr Asn Asn Asp Leu Gin He Asn Gly He Trp Met Met Pro He Asn Thr Ser 
Pro Ser Tyr His Lys Tyr Asp Val Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Ser Leu Gin Asp 
Phe Arg Glu Leu Thr Thr Glu Ala His Lys Arg Asn Val Lys Val Val He Asp Leu Val He Asn His 
Thr Ser Ser Glu His Pro Trp Phe Val Asp Ala Leu Lys Asn Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr 
He Trp Ala Asp Lys Asn Thr Asp Leu Asn Glu Lys Gly Pro Tip Gly Gin Gin Val Trp His Lys Ala 
Ser Asn Gly Glu Tyr Phe Tyr Ala Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Lys 
Val Arg Glu Glu Met He Lys He Gly Lys Phe Trp Leu Lys Gin Gly Ala Asp Gly Phe Arg Leu Asp 
Ala Ala Met His He Phe Lys Gly Gin Thr Pro Glu Gly Ala Lys Lys Asn He Glu Tip Trp Asn Glu 
Phe Arg Asp Ala Met Arg Glu Thr Asn Pro Asn Thr Tyr Leu Val Gly Glu He Trp Asp Gin Pro Glu 
Val Val Ala Pro Tyr Tyr Gin Ser Leu Asp Ser Thr Phe Asn Phe Asp Leu Ala Tyr Lys He Val Asn 
Ser Val Lys Asn Gly Thr Asp Gin Gly Val Ala Ala Ala Ala Val Ala Thr Asp Glu Leu Tyr Lys Thr 
Tyr Asn Pro Asn Lys He Asp Gly Thr Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu 
Asn Gly Asp Val Asn Lys Ala Lys Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Phe He 
Tyr Tyr Gly Glu Glu He Gly Met Thr Gly Gin Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp 
Tyr Glu Asp Asp Lys Glu Gly Gin Thr Ser Tip Glu Thr Pro Val Tyr Asn He Asp His Asn Gly Val 
Ser Val Glu Ala Gin Asp Lys Gin Lys Ala Ser Leu Leu Ser His Tyr Arg Lys Met He Arg Val Arg 
Gin Gin His Asp Glu Leu Val Lys Gly Asn Leu Glu Pro He Ser Val Asn Asn Ser Gin Val Val Ala 
Tyr Asn Arg Thr Tyr Lys Asn Lys Ser He Gin Val Tyr His Asn He Ser Asp Lys Pro Val Thr Leu 
Thr Val Ser Asn Lys Gly Lys Leu He Phe Ser Ser Glu Leu Gly Ala Lys Lys Glu Lys Ser Thr Leu 
Val He Pro Ala Asn Thr Thr Val Leu Val Lys 

SEQroNO: 155 

gtgtcaagaatgtttgcaaaacgattcaaaacctctttactgccgttattcgctggatttttattgctgtttcatttggttctggcaggaccaacggctg 

cgaatgctgaaacggctaacaaatcaaatgagcttacagcaccgtcgatcaaaagcggaaccattcttcatgcttggaattggtcgttcaatacgt 

taaaacacaatatgaaggatattcatgatgcaggatatacagcgattcagacgtctccgattaaccaagtcaaggaagggaaccaaggaaataa 

aaacatgtcgaactggtactggctctatcagccgacatcgtaccaaattggcaaccgttacttaggtactgaacaagaatttaaagaaatgtgtgc 

agccgctgaagaatatggcataaaggttattgttgacgcggtcatcaatcatacxaccagtgactatgccgcgatttccaatgagattaaga 

ccaaactggacacatggaaacacacaaattaaaaactggtctgatcgatgggatgtcacgcagaatgcattgctcgggctgtatgactggaata 

cacaaaatacacaagtacagtcctatttgaaacggttcttagaaagagcattgaatgacggggcagacggttttcgatttgatgccgccaaacata 

tagagcttccggatgatggcagttacggcagtcaattttggccgaatatcacaaatacatctgcagagttccaatacggagaaatcctgcaggat 

agtgcttcaagagatgcttcatatgcgaattatatgaatgtgacagcgtctaactatgggcattccataaggtccgctttaaagaatcgte 

cgtgtcgaatatctcccactatgcatcagatgtgtctgcggacaagctagtgacatgggtagaatcgcatgatacgtatgccaatgatgatgaag 

agtcgacatggatgagcgatgatgatatccgtttaggctgggcggtgatagcttctcgttcaggcagtacgcctcttttcttttccagacctg^ 

aggcggaaatggtgtgagattcccggggaaaagccaaataggcgatcgcgggagtgctttatttgaagatcaggctatcactgcggtcaatag 

atttcacaatgtgatggctggacagcctgaggaactctcgaacccaaatggaaacaaccagatatttatgaatcagcgcggctcacatggcgttg 

tgctggcaaatgcaggttcatcctctgtttctatcaatacgccaacaaaattgcctgatggcaggtatgataataaagctggggcaggttcatttca 

agtaaatgacggtaaactgacaggcacgatcaatgccaggtctgtggctgtgctttatcctgatgatattgcaaaagcgcctcatgttttccttgag 

aattacaaaacaggtgtaacacattctttcaatgatcaactgacgattacactgcgtgcagatgcgaatacaacaaaagccgtttatcaaatcaata 

atggaccagagacggcgtttaaggatggagatcaattcacaatcggaaaaggagatccatttggcaaaacatacaccatcatgttaaaaggaac 

gaacagtgatggtgtaacgaggaccgaggaatacagttttgttaaaagagatccagcttcggccaaaaccatcggctatcaaaatccgaatcatt 

ggagccaggtaaatgcttatatctataaacatgatgggggccgggca 



SEQIDNO: 156 
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Val Ser Arg Met Phe Ala Lys Arg Phe Lys Thr Set Leu Leu Pro Leu Phe Ala Gly Phe Leu Leu Leu 
Phe His Leu Val Leu Ala Gly Pro Thr Ala Ala Asn Ala Glu Thr Ala Asn Lys Ser Asn Glu Leu Thr 
Ala Pro Ser He Lys Ser Gly Thr He Leu His Ala Trp Asn Trp Ser Phe Asn Thr Leu Lys His Asn 
Met Lys Asp He His Asp Ala Gly Tyr Thr Ala He Gin Thr Ser Pro He Asn Gin Val Lys Glu Gly 
Asn Gin Gly Asn Lys Asn Met Ser Asn Trp Tyr Trp Leu Tyr Gin Pro Thr Ser Tyr Gin He Gly Asn 
Arg Tyr Leu Gly Thr Glu Gin Glu Phe Lys Glu Met Cys Ala Ala Ala Glu Glu Tyr Gly He Lys Val 
He Val Asp Ala Val He Asn His Thr Thr Ser Asp Tyr Ala Ala He Ser Asn Glu He Lys Ser He Pro 
Asn Trp Thr His Gly Asn Thr Gin He Lys Asn Trp Ser Asp Arg Trp Asp Val Thr Gin Asn Ala Leu 
Leu Gly Leu Tyr Asp Trp Asn Thr Gin Asn Thr Gin Val Gin Ser Tyr Leu Lys Arg Phe Leu Glu 
Arg Ala Leu Asn Asp Gly Ala Asp Gly Phe Arg Phe Asp Ala Ala Lys His He Glu Leu Pro Asp Asp 
Gly Ser Tyr Gly Ser Gin Phe Trp Pro Asn He Thr Asn Thr Ser Ala Glu Phe Gin Tyr Gly Glu He 
Leu Gin Asp Ser Ala Ser Arg Asp Ala Ser Tyr Ala Asn Tyr Met Asn Val Thr Ala Ser Asn Tyr Gly 
His Ser He Arg Ser Ala Leu Lys Asn Arg Asn Leu Gly Val Ser Asn He Ser His Tyr Ala Ser Asp 
Val Ser Ala Asp Lys Leu Val Thr Trp Val Glu Ser His Asp Thr Tyr Ala Asn Asp Asp Glu Glu Ser 
Thr Trp Met Ser Asp Asp Asp He Arg Leu Gly Trp Ala Val He Ala Ser Arg Ser Gly Ser Thr Pro 
Leu Phe Phe Ser Arg Pro Glu Gly Gly Gly Asn Gly Val Arg Phe Pro Gly Lys Ser Gin He Gly Asp 
Arg Gly Ser Ala Leu Phe Glu Asp Gta Ala He Thr Ala Val Asn Arg Phe His Asn Val Met Ala Gly 
Gin Pro Glu Glu Leu Ser Asn Pro Asn Gly Asn Asn Gin He Phe Met Asn Gin Arg Gly Ser His Gly 
Val Val Leu Ala Asn Ala Gly Ser Ser Ser Val Ser He Asn Thr Pro Thr Lys Leu Pro Asp Gly Arg 
Tyr Asp Asn Lys Ala Gly Ala Gly Ser Phe Ghi Val Asn Asp Gly Lys Leu Thr Gly Thr He Asn Ala 
Arg Ser Val Ala Val Leu Tyr Pro Asp Asp He Ala Lys Ala Pro His Val Phe Leu Glu Asn Tyr Lys 
Thr Gly Val Thr His Ser Phe Asn Asp Gin Leu Thr He Thr Leu Arg Ala Asp Ala Asn Thr Thr Lys 
Ala Val Tyr Gin He Asn Asn Gly Pro Glu Thr Ala Phe Lys Asp Gly Asp Gin Phe Thr He Gly Lys 
Gly Asp Pro Phe Gly Lys Thr Tyr Thr He Met Leu Lys Gly Thr Asn Ser Asp Gly Val Thr Arg Thr 
Glu Glu Tyr Ser Phe Val Lys Arg Asp Pro Ala Ser Ala Lys Thr He Gly Tyr Gin Asn Pro Asn His 
Trp Ser Gin Val Asn Ala Tyr He Tyr Lys His Asp Gly Gly Arg Ala 

SEQIDNO: 157 

atgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagcattagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgccgaaaccgtacataaaggtaaagctccaacagcagataaaaacggtgtcttttatgaggtgtatgtaaactctttttacg 

gcaaataaagatggacatggtgatttaaaaggtcttacacaaaagctggattatttgaatgacggcaattctcataccaaaaatgatcttcaagtaa 

acggaatttggatgatgccggtaaacccttctcctagctatcataaatatgatgtaacggactattataacattgatccgcagtacggaaatctgca 

agattttcgcaagctgatgaaagaagcagataaacgagacgtaaaggttattatggacctcgttgtgaatcatacaagcagtgaacatcctt^ 

tcaagctgcattaaaagataaaaacagcaagtacagagattactatatttgggccgataaaaatactgatttaaatgaaaaaggatcttgggggca 

gcaagtatggcataaagctccaaacggagagtatttttatggtacgttttgggaaggaatgcctgacttaaattacgataatcccgaagtaagaaa 

agaaatgattaacgtcgggaaattttggctaaagcaaggcgttgacgggttccgcttagatgctgcgcttcatatttttaaaggtcaaacacctgaa 

ggcgctaagaaaaatatcgtgtggtggaatgagtttagagatgcaatgaaaaaagaaaaccctaacgtatatctaacgggtgaagtatgggatc 

aaccggaagtagtagctccttactatcaatcgcttgattctttatttaactttgatttagcaggaaagattgtaaactctgtaaaatcaggaaatgatca 

aggaatcgcgactgcagcagccgcaactgatgagctgttcaaatcatacaatccaaataaaattgacggcattttcttaaccaaccatgaccaaa 

atcgcgtcatgagtgagctaagcggcgatgtgaataaagcaaagtcagctgcctctatcttacttacgcttcctggcaacccgtatatttattacgg 

tgaagaaattggaatgaccggtgaaaagcctgatgagttaatccgtgaaccgttccgctggtacgaaggcaatggacttggacaaaccagctg 

ggaaacatccgtatacaacaaaggcggcaatggtgtgtcagtagagacacaaacaaaacaaaaggattctttgttaaatcattaccgtgaaatga 

ttcgcgtgcgtcagcagcatgaagagttagtaaaaggaacccttcaatctatttcagtagacagtaaagaagtcgttgcctatagccgcacgtata 

aaggcaaatcgattagcgtgtatcataatatttcaaatcaaccggtaaaagtatctgtaacagcgaaaggtaaattgatttttgctagtgaaaaaggt 

gcaaaaaaagtcaaaaatcagcttgtggttccagctaatacaacggttttaataaaataa 

SEQIDNO: 158 

Met Gin Thr He Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ala Pro Thr 
Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His 
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Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly He Tip Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Trp Gly Gin Gin Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Trp Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly 
Gin Thr Pro Glu Gly Ala Lys Lys Asn He Val Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu 
Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser 
Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Asn Ser Val Lys Ser Gly Asn Asp Gin 
Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly 
He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala Lys 
Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr 
Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr 
Ser Trp Glu Thr Ser Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Thr Gin Thr Lys Gin Lys 
Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly 
Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys Ser He 
Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Thr Ala Lys Gly Lys Leu He Phe Ala 
Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val Val Pro Ala Asn Thr Thr Val Leu He Lys 

SEQIDNO: 159 

ttgcaaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatcaacaggcgttc 

acgccgaaaccgtacataaaggtaaatctccaacagcagataaaaacggtgtattttatgaggtgtatgtaaactctttttacgatgcaaat^ 

tggacatggtgatttaaaaggtcttacacaaaagttggattatttaaatgatggcaattctcatacaaagaatgatcttcaagtaaacgggatttggat 

gatgccggtcaacccttctcccagctatcataaatatgatgtaacggactattataatattgatccgcagtatggaaatctgcaagattttcgcaaac 

tgatgaaagaagcagataaacgagatgtaaaagtcattatggacctcgttgtgaatcatacgagcagtgaacacccttggtttcaagctgcattaa 

aagataaaaacagcaagtacagagattactatatctgggctgataaaaataccgacttgaatgaaaaaggatcttggggacagcaagtatggca 

taaagctccaaacggagagtatttttacggaacgttttgggaaggaatgccggacttaaattacgataatcctgaagtaagaaaagaaatgattaa 

cgtaggaaagttttggctaaagcaaggagttgatgggttccgtctagatgctgcgcttcatatttttaaaggccaaacacctgaaggcgc 

aaatctcctgtggtggaatgaatttagagatgcaatgaaaaaggaaaaccctaacgtatatctaacgggtgaagtatgggatcaaccggaagta 

gtagctccttactatcaatcgcttgattctttatttaactttgatttagcaggaaagattgtaaactctgtaaaatcaggaaatgatcaaggaatcgcga 

ctgcagcagcggcaacggatgaactgttcaaatcatacaatccaaataaaattgacggtattttcttaaccaaccatgaccaaaatcgcgtcatga 

gtgagctaaacggcgatgtgaataaagcaaagtcagctgcctctatcttacttacgcttcctggcaacccgtatatttattacggtgaagaaatcgg 

catgaccggtgaaaagcctgatgagttaatccgtgaaccgttcccctggtacgaaggaaacggacttggacaaaccagctgggaaacacctgt 

atataacaaaggcggcaacggcgtgtctgtagaagcacaaacaaaacaaaaggactctttgttaaatcattaccgtgaaatgattcgcgtgcgtc 

agcagcacgaagagttagtaaaaggaacgcttcaatctatttcagtagacagtaaagaagtcgttgcctatagccgtacgtataaaggcaaatcg 

attagcgtgtatcataatatttcaaatcaaccggtaaaagtatctgtagcagcaaaaggtaaattgatttttgctagtgaaaaaggtgctaagaaagt 

caaaaatcagcttgtgattccggcgaatacaacggttttaataaaataa 

SEQroNO:160 

Met Gin Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr Leu Pro 
Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ser Pro Thr Ala Asp 
Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His Gly Asp 
Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn Asp Leu 
Gin Val Asn Gly He Trp Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val Thr Asp 
Tyr Tyr Asn He Asp Pro Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala Asp 
Lys Arg Asp Val Lys Val He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Tip Phe Gin 
Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp Leu 
Asn Glu Lys Gly Ser Trp Gly Gin Gin Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly Thr 
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Phe Tip Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val Gly 
Lys Phe Trp Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly Gin 
Thr Pro Glu Gly Ala Lys Lys Asn Leu Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu 
Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser 
Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Asn Ser Val Lys Ser Gly Asn Asp Gin 
Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly 
He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Asn Gly Asp Val Asn Lys Ala Lys 
Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr 
Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Pro Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr 
Ser Trp Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys 
Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly 
Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys Ser He 
Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Ala 
Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val He Pro Ala Asn Thr Thr Val Leu He Lys 

SEQIDNO: 161 

gtggatccaaagaattgtagtcaatttatgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaa 

cactgccgctggctgctagcttatcaacaggtgttcacgccgaaaccgtacataaaggtaaagctccaacagcagataaaaacggtgtcttttat 

gaggtatatgtaaactctttttacgatgcaaataaagatggacatggtgatttaaaaggccttacacaaaagttggactatttaaatgacggaaattc 

tcatacaaagaatgatcttcaagtaaacgggatttggatgatgccggtcaacccttctcctagctatcataaatatgatgtaacggactattataatat 

tgatccgcagtatggaaatctgcaagattttcgcaaacttatgaaagaagcagataaacgagacgtaaaagtcattatggaccttgttgtgaatcat 

acgagcagtgaacacccttggtttcaagctgcgttgaaagataaaaacagcaagtacagagattactatatttgggctgataaaaatactgacttg 

aatgaaaaaggatcttggggacaacaagtatggcataaagctccaaacggagagtatttttacggaacgttctgggaaggaatgcctgacttaa 

attacgataaccctgaagtaagaaaagaaatgattaacgtcggaaagttttggctaaaacaaggcgttgacggcttccgcttagatgctgcccttc 

atatttttaaaggtcaaacgcctgaaggcgctaagaaaaacattctatggtggaatgagtttagagatgcgatgaaaaaagaaaacccgaacgta 

tatctaacgggtgaagtgtgggaccagccagaagtagtagccccttactatcaatcacttgattctctatttaattttgatttagcaggaaaaattgtc 

agctctgtaaaagcaggaaatgatcaaggaatcgccactgcagcagcggcaactgatgagctgttcaaatcatacaatccaaataaaattgacg 

gcattttcttaaccaaccatgaccaaaatcgcgtcatgagtgagttaagcggcgatgtgaataaagcaaaatcagccgcctctatcttacttacgct 

tcctggaaatccgtatatttattacggtgaagaaattggcatgacaggtgaaaagcctgatgaattaatccgtgaaccgttccgctggtacgaagg 



ccttgttaaatcattaccgtgaaatgattcgtgtgcgccagcagcacgaagagttagtaaaaggaacgcttcaatccatttcagtagacagtaaag 
aagtcgttgcctatagccgcacgtacaaaggcaaatcgattagcgtgtatcataatatttcaaatcaacctgtaaaagtatctgtagcagcgaaag 
gtaacttgatttttgctagtgaaaaaggtgctaagaaagtcaaaaatcagcttgtgattccggcgaatgcgacggttttaataaaataa 

SEQIDNO: 162 

Val Asp Pro Lys Asn Cys Ser Gin Phe Met Gin Thr He Ala Lys Lys Gly Asp Glu Thr Met Lys Gly 
Lys Lys Trp Thr Ala Leu Ala Leu Thr Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu 
Thr Val His Lys Gly Lys Ala Pro Thr Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser 
Phe Tyr Asp Ala Asn Lys Asp Gly His Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu 
Asn Asp Gly Asn Ser His Thr Lys Asn Asp Leu Gin Val Asn Gly He Trp Met Met Pro Val Asn Pro 
Ser Pro Ser Tyr His Lys Tyr Asp Val Thr Asp Tyr Tyr Asn He Asp Pro Ghi Tyr Gly Asn Leu Gin 
Asp Phe Arg Lys Leu Met Lys Glu Ala Asp Lys Arg Asp Val Lys Val He Met Asp Leu Val Val 
Asn His Thr Ser Ser Glu His Pro Trp Phe Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp 
Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp Leu Asn Glu Lys Gly Ser Tip Gly Gin Gin Val Tip His 
Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp 
Asn Pro Glu Val Arg Lys Glu Met He Asn Val Gly Lys Phe Tip Leu Lys Gin Gly Val Asp Gly Phe 
Arg Leu Asp Ala Ala Leu His He Phe Lys Gly Gin Thr Pro Glu Gly Ala Lys Lys Asn He Leu Trp 
Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp 
Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly 
Lys He Val Ser Ser Val Lys Ala Gly Asn Asp Gin Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu 
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Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly He Phe Leu Thr Asn His Asp Gin Asn Arg Val 
Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala Lys Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly 
Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr Gly Glu Lys Pro Asp Glu Leu He Arg Glu 
Pro Phe Arg Trp Tyr Glu Gly Asn Gly He Gly Gin Thr Ser Trp Glu Thr Pro Val Tyr Asn Lys Gly 
Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys Asp Ser Leu Uu Asn His Tyr Arg Glu Met 
He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly Thr Leu Gin Ser He Ser Val Asp Ser Lys 
Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys Ser He Ser Val Tyr His Asn He Ser Asn Gin Pro 
Val Lys Val Ser Val Ala Ala Lys Gly Asn Leu He Phe Ala Ser Glu Lys Gly Ala Lys Lys Val Lys 
Asn Gin Leu Val He Pro Ala Asn Ala Thr Val Leu He Lys 

SEQIDNO: 163 

atggtacgtcccgaacgacgggctgcattggaaccgactatcgaacgactcgcagcacttgaaagacattgggtgacgacggtgtggattccg 

ccggcgtacaaaggcacgtcacagaacgatgtcgggtatggggcgtacgatttatacgatctcggcgaattcaaccaaaaagggacgacccg 

gacgaagtacgggacgaaagcgcagctccagaccgccatctcgaacttgcgcggtaaagggatcggtgtgtacggcgacgtcgtcatgaat 

cacaagggcggggccgattataccgaatccgttcaggcgatcgaggtcaatccgtcgaaccggaaccaagaaacgtccggtgagtatggcat 

ctcggcctggactgggttcaacttcgcggggcgcaacaatacatactcgccgttcaaatggcgctggtaccattttgacggtaccgattgggatc 

agtcacgcagcttgagccgcatctataagttcaagagcacaggcaaggcgtgggacacggacgtgtcgaacgagaacggcaactatgattat 

cttatgtatgccgacgtcgatttcgagcatcccgaggtccgccaagagatgaagaactggggcaaatggtacgccgactcgctcgggctcgac 

ggtttccggttggatgcggtcaaacatatcagccactcgtaottgaaggagtgggtgacgagcgtgcgccagacgaccgggaaagagatgttc 

acggtcgccgagtattggaagaacgatctcggtgccatcaacgactatctgtataiagacgggctacacgcactccgtcttcgatgtgccgctcc 

attataacttccaagcggccggtaacggcggcgggtattacgatatgcgcaacatcttgaaaggcaccgtcaccgaacagcatccgtcgctgtc 

cgtgacgattgtcgataaccacgactcacagccgggccagtcgctcgagtcgacggtcgccaactggttcaaaccgctcgcctacgcgacga 

tcatgacgcgcggtcagggttatccggccctcttctatggagactattatggcacgaaagggacgacgaaccgcgaaatcccgaacatgtcgg 

gcacgctccaaccgattttgaaggcacgaaaagacttcgcctacgggacgcagcatgactacctcgatcatcaggacgtcatcggctggacac 

gtgaaggtgtgaccgaccgtgccaaatcgggtctcgcgacgattctatcggacggtccgggcggctcgaagtggatgtacgtcggcaaacag 

aacgccggcgaggtatggaaagacatgacgaacaacaacgcccgtctcgtcacgatcaatgctgacggctggggtcagttcttcgtcaacgg 

aggctcggtctcgatttatacgcaacaataa 

SEQIDNO: 164 

Met Val Arg Pro Glu Arg Arg Ala Ala Leu Glu Pro Thr He Glu Arg Leu Ala Ala Leu Glu Arg His 
Trp Val Thr Thr Val Trp He Pro Pro Ala Tyr Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Thr Arg Thr Lys Tyr Gly Thr Lys 
Ala Gin Leu Gin Thr Ala He Ser Asn Leu Arg Gly Lys Gly He Gly Val Tyr Gly Asp Val Val Met 
Asn His Lys Gly Gly Ala Asp Tyr Thr Glu Ser Val Gin Ala He Glu Val Asn Pro Ser Asn Arg Asn 
Gin Glu Thr Ser Gly Glu Tyr Gly He Ser Ala Trp Thr Gly Phe Asn Phe Ala Gly Arg Asn Asn Thr 
Tyr Ser Pro Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Ser Leu Ser 
Arg He Tyr Lys Phe Lys Ser Thr Gly Lys Ala Trp Asp Thr Asp Val Ser Asn Glu Asn Gly Asn Tyr 
Asp Tyr Leu Met Tyr Ala Asp Val Asp Phe Glu His Pro Glu Val Arg Gin Glu Met Lys Asn Trp 
Gly Lys Trp Tyr Ala Asp Ser Leu Gly Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Ser His 
Ser Tyr Leu Lys Glu Trp Val Thr Ser Val Arg Gin Thr Thr Gly Lys Glu Met Phe Thr Val Ala Glu 
Tyr Trp Lys Asn Asp Leu Gly Ala He Asn Asp Tyr Leu Tyr Lys Thr Gly Tyr Thr His Ser Val Phe 
Asp Val Pro Leu His Tyr Asn Phe Gin Ala Ala Gly Asn Gly Gly Gly Tyr Tyr Asp Met Arg Asn He 
Leu Lys Gly Thr Val Thr Glu Gin His Pro Ser Leu Ser Val Thr He Val Asp Asn His Asp Ser Gin 
Pro Gly Gin Ser Leu Glu Ser Thr Val Ala Asn Trp Phe Lys Pro Leu Ala Tyr Ala Thr He Met Thr 
Arg Gly Gin Gly Tyr Pro Ala Leu Phe Tyr Gly Asp Tyr Tyr Gly Thr Lys Gly Thr Thr Asn Arg Glu 
He Pro Asn Met Ser Gly Thr Leu Gbi Pro lie Leu Lys Ala Arg Lys Asp Phe Ala Tyr Gly Thr Gin 
His Asp Tyr Leu Asp His Gin Asp Val He Gly Tip Thr Arg Glu Gly Val Thr Asp Arg Ala Lys Ser 
Gly Leu Ala Thr He Leu Ser Asp Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin Asn Ala 
Gly Glu Val Trp Lys Asp Met Thr Asn Asn Asn Ala Arg Leu Val Thr He Asn Ala Asp Gly Trp Gly 
Gin Phe Phe Val Asn Gly Gly Ser Val Ser lie Tyr Thr Gin Gin 
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SEQIDNO: 165 

atgcagtatttcgagtggtacgtgccaaatgatggggaacaUggaatcgtUgcgtaatgatgctgaaaatttagctca 

atggataccacccgtatataaaggaacttcacaaaatgatgtagggtatggagtgtatgatgtatatgatttgggagaattcaatcaaaaaggaac 

gatacggacaaaatatgggacaaaagcacaattaaaatctgcaangaggctttacataatcaaaatatcgatgtatacggtgatgttg^ 

cataaaggtggggcagattatactgaggttgtaacagccgttgaggtagaccgtaacaatcgaaatangaaacatcga^^^ 

gcgtggacgggamgatmccaggacgcagggactcctattctaattttaaatggagatggmcattttgatggaacagattgggatgaggga 

ggaaattaaatagaatttataaatttaaaggcgtaggtaaagcttgggactgggaagtgtctagtgagaatggtaactat^^ 

gatcttgamcgatcatcctgaagttgcaaatgaaatgaaaaactggggaacctggtatgcggacgaattaaatttagat^^^ 

cagttaaacatattgaccatgagtatcttcgtgattgggtaaatcatgttagaaagcaaacggggaaggaaatgmacagtagctgaatatt^^ 

aaatgataUcgtactttaaacaattatttagggaaagtaaattataatcaatctgtgttcgatgcacctcttcattataatm^ 

gaatggaaattatgautgaggaatattttaaagggtacggtagtagaaagtcatcctacacttgctgttactcttgtt^ 

tggacagtcattagaatctgrtgtgaglccttggtttaagccgttggcctatgcatttatmaacgcgjgcagaagggtatccttct 

attactatggcacaaatggaaatagtagttatgaaanccaacgttaaaggaUaaaUgatccaaUctgacggcacgaaaaaacmgcate^^ 

tacgcaacatgattatttagaccatccagatgtgaUggctggacaagagaaggggatagtatacatgctaattctggt^ 

atggaccaggaggatcaaaatggatgaatgttggaaagaacaacgcaggggaaatatggtacgatattacgggcaatcaaacaaatactgtaa 

cgattaataaagatggatgggggcagttccatgtaaatgggggctctgtttcaatatatgttcagaagtaa 

SEQIDNO: 166 

Met Gin Tyr Phe Glu Trp Tyr Val Pro Asn Asp Gly Glu His Trp Asn Arg Leu Arg Asn Asp Ala 
Glu Asn Leu Ala His Lys Gly He Thr Ser Val Trp He Pro Pro Val Tyr Lys Gly Thr Ser Gin Asn 
Asp Val Gly Tyr Gly Val Tyr Asp Val Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr He Arg Thr 
Lys Tyr Gly Thr Lys Ala Gin Leu Lys Ser Ala He Glu Ala Leu His Asn Gin Asn He Asp Val Tyr 
Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp Tyr Thr Glu Val Val Thr Ala Val Glu Val Asp 
Arg Asn Asn Arg Asn He Glu Thr Ser Ser Asp Tyr Gin He Asp Ala Trp Thr Gly Phe Asp Phe Pro 
Gly Arg Arg Asp Ser Tyr Ser Asn Phe Lys Trp Arg Trp Phe His Phe Asp Gly Thr Asp Trp Asp Glu 
Gly Arg Lys Leu Asn Arg He Tyr Lys Phe Lys Gly Val Gly Lys Ala Trp Asp Trp Glu Val Ser Ser 
Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Phe Asp His Pro Glu Val Ala Asn 
Glu Met Lys Asn Trp Gly Thr Trp Tyr Ala Asp Glu Leu Asn Leu Asp Gly Phe Arg Leu Asp Ala 
Val Lys His He Asp His Glu Tyr Leu Arg Asp Trp Val Asn His Val Arg Lys Gin Thr Gly Lys Glu 
Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp He Arg Thr Leu Asn Asn Tyr Leu Gly Lys Val Asn 
Tyr Asn Gin Ser Val Phe Asp Ala Pro Leu His Tyr Asn Phe His Tyr Ala Ser Thr Gly Asn Gly Asn 
Tyr Asp Met Arg Asn He Leu Lys Gly Thr Val Val Glu Ser His Pro Thr Leu Ala Val Thr Leu Val 
Glu Asn His Asp Ser Gin Pro Gly Gin Ser Leu Glu Ser Val Val Ser Pro Trp Phe Lys Pro Leu Ala 
Tyr Ala Phe He Leu Thr Arg Ala Glu Gly Tyr Pro Ser Val Phe Tyr Gly Asp Tyr Tyr Gly Thr Asn 
Gly Asn Ser Ser Tyr Glu He Pro Thr Leu Lys Asp Lys He Asp Pro He Leu Thr Ala Arg Lys Asn 
Phe Ala Tyr Gly Thr Gin His Asp Tyr Leu Asp His Pro Asp Val He Gly Trp Thr Arg Glu Gly Asp 
Ser He His Ala Asn Ser Gly Leu Ala Thr Leu He Ser Asp Gly Pro Gly Gly Ser Lys Trp Met Asn 
Val Gly Lys Asn Asn Ala Gly Glu He Trp Tyr Asp He Thr Gly Asn Gin Thr Asn Thr Val Thr He 
Asn Lys Asp Gly Trp Gly Gin Phe His Val Asn Gly Gly Ser Val Ser He Tyr Val Gin Lys 

SEQ ID NO: 167 

atgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgccgaaaccgtacataaaggtaaatctccaacagcagataaaaacggtgtattttatgaggtgtatgtaaactcttttta^ 

caaataaagatggacatggtgatttaaaaggtcttacacaaaagttggattatttaaatgatggcaattctcatacaaagaatgatcttcaagtaaac 

gggatttggatgatgccggtcaacccttctcccagctatcataaatatgatgtaacggactattataatattgatccgcagtatggaaatctgcaag 

attttcgcaaactgatgaaagaagcagataaacgagatgtaaaagtcattatggacctcgttgtgaatcatacgagcagtgaacacccttggtttc 

aagctgcattaaaagataaaaacagcaagtacagagattactatatctgggctgataaaaataccgacttgaatgaaaaaggatcttggggaca 

gcaagtatggcataaagccccaaacggagagtatttttacggaacgttttgggaaggaatgccggacttaaattacgataatcctgaagtaagaa 

aagaaatgattaacgtaggaaagttttggctaaagcaaggagttgacgggttccgtctagatgctgcgcttcatatttttaaaggccaaacacctg 
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aaggcgctaagaaaaatctcctgtggtggaatgaatttagagatgcaatgaaaaaggaaaaccctaacgtatatctaacgggtgaagtatgg^ 

tcaaccggaagtagtagctcctUctatcaatcgcUgaUctttatttaacmgatdagcaggaaagangtaaactctgtaaa 

caaggaatcgcgactgcagcagcggcaacggatgaactgUcaaatcatacaatccaaataaaangacggtatmcttaaccaaccatga^ 

aaatcgcgtcatgagtgagctaagcggcgatgtgaataaagcaaagtcagctgcctctatcttacttacgcttcctggcaacccgtata^ 

ggtgaagaaatcggcatgaccggtgaaaagcctgatgagttaatccgtgaaccgttccgctggtacgaaggaaacggacttggacaaaccag 

ctgggaaacacxtgtatacaacaaaggcggcaacggcgtgtctgtagaagcacaaacaaaacaaaaggactcmgUaaatcat^ 

atgattcgcgtgcgtcagcagcacgaagagttagtaaaaggaacgcttcaatctatttcagtagacagtaaagaagtcgttgcctatagccgcac 

gtataaaggcaaatcgattagcgtgtatcataatamcaaatcaaccggtaaaagtatctgtagcagcaaaaggtaaattgatm 

aaggtgctaagaaagtcaaaaatcagcttgtgattccggcgaatacaacggttttaataaaataa 

SEQ ID NO: 168 

Met Gin Thr He Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Tip Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ser Pro Thr 
Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His 
Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly He Trp Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val lie Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Tip Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Tip Gly Gin Gin Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Tip Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly 
Gin Thr Pro Glu Gly Ala Lys Lys Asn Leu Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys 
Glu Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin 
Ser Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Asn Ser Val Lys Ser Gly Asn Asp 
Gin Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp 
Gly He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala 
Lys Ser Ala Ala Ser He Leu-Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met 
Thr Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin 
Thr Ser Trp Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin 
Lys Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys 
Gly Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys 
Ser He Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He 
Phe Gly Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val He Pro Ala Asn Thr Thr Val Leu 
He Lys 

SEQ ID NO: 169 

atgaaaacattcaaattaaaacgcacttttttaccgctaaccttgctgctcagtgctcctgcctttgctgggcaaaatggcaccatgatgcagtatttt 

cattggtatgtacctaatgatggcgcattatggacgcaggttgaaagcaatgctccagcactcgctgaaaacggttttacagcgctctggctacc 

gccagcttacaaaggcgcgggcggcagtaatgacgtcggttatggcgtctatgatatgtacgatttaggtgagtttgatcaaaaaggctcagtac 

gaaccaaatacggcaccaaggctcagtacatctctgcaatcaatgccgcgcacaacaacaatatccaaatctacggcgatgttgtgtttaaccac 

cgaggtggtgctgatgggaagtcgtgggtcgataccaagcgcgttgattgggacaaccgtaacattgaactgggcgacaaatggattgaagct 

tgggttgagtttaattttcctggccgcaacgacaaatactcaaacttccattggacttggtatcactttgacggtgttgactgggatgatgccggcaa 

agaaaaagcgatctttaaattcaaaggcgaaggaaaagcatgggattgggaagtcagctctgaaaaaggcaattacgactacctaatgtacgc 

cgatttagacatggatcaccaagaagttaaacaagagctgaaagattggggtgagtggtacatcaacatgaccggcgttgatggctttagaatg 

gatgccgtgaagcacattaaatatcagtatctacaagagtggattgatcatttacgttggaaaacaggcaaagagcttttcaccgttggtgagtatt 

ggaattacgacgtaaatcaactgcataactttattactaagacctctggcagtatgtcgttgttcgatgcgccgcttcacatgaacttctacaacgcg 

tcaaaatctggcggcaattacgatatgcgccaaatcatgaatggcacgttgatgaaggacaacccagtcaaagctgtgactctcgtagaaaacc 

acgatacacagccattgcaggcgttagagtcgacagtggattggtggttcaagcctcttgcttacgcattcattttattgcgtgaagaaggttatcc 

atcagtgttctacgcagattactacggcgcgcagtacagcgacaaaggctacaacatcaatatggccaaagttccttacattgaagaacttgtaa 
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cactgcgtaaagagtatgcgtatggcaaacagaattcttatctcgaccactgggatgtgattggctggacccgagagggcgatgctgaacatcc 
aaactcaatggcggtgatcatgagtgatggaccaggtggcaaaaaatggatgtataccggtaagccaagcacgcgctatgtcgacaagctgg 
gtatccgaactgaagaagmggaccgataccaatggctgggcagaamcctgtcaatggtggttcagtctcggtttgggtgggc^^ 

SEQIDNO:170 

Met Lys Thr Phe Lys Leu Lys Arg Thr Phe Leu Pro Leu Thr Leu Leu Leu Ser Ala Pro Ala Phe Ala 
Gly Gin Asn Gly Thr Met Met Gin Tyr Phe His Trp Tyr Val Pro Asn Asp Gly Ala Leu Trp Thr Gin 
Val Glu Ser Asn Ala Pro Ala Leu Ala Glu Asn Gly Phe Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
Gly Ala Gly Gly Ser Asn Asp Val Gly Tyr Gly Val Tyr Asp Met Tyr Asp Leu Gly Glu Phe Asp 
Gin Lys Gly Ser Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr He Ser Ala He Asn Ala Ala His 
Asn Asn Asn He Gin He Tyr Gly Asp Val Val Phe Asn His Arg Gly Gly Ala Asp Gly Lys Ser Tip 
Val Asp Thr Lys Arg Val Asp Trp Asp Asn Arg Asn He Glu Leu Gly Asp Lys Trp He Glu Ala Trp 
Val Glu Phe Asn Phe Pro Gly Arg Asn Asp Lys Tyr Ser Asn Phe His Trp Thr Trp Tyr His Phe Asp 
Gly Val Asp Trp Asp Asp Ala Gly Lys Glu Lys Ala He Phe Lys Phe Lys Gly Glu Gly Lys Ala Trp 
Asp Trp Glu Val Ser Ser Glu Lys Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 
His Gin Glu Val Lys Gin Glu Leu Lys Asp Trp Gly Glu Trp Tyr He Asn Met Thr Gly Val Asp Gly 
Phe Arg Met Asp Ala Val Lys His He Lys Tyr Gin Tyr Leu Gin Glu Trp He Asp His Leu Arg Trp 
Lys Thr Gly Lys Glu Leu Phe Thr Val Gly Glu Tyr Trp Asn Tyr Asp Val Asn GUi Leu His Asn 
Phe He Thr Lys Thr Ser Gly Ser Met Ser Leu Phe Asp Ala Pro Leu His Met Asn Phe Tyr Asn Ala 
Ser Lys Ser Gly Gly Asn Tyr Asp Met Arg Gin He Met Asn Gly Thr Leu Met Lys Asp Asn Pro Val 
Lys Ala Val Thr Leu Val Glu Asn His Asp Thr Gin Pro Leu Gin Ala Leu Glu Ser Thr Val Asp Tip 
Tip Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Leu Arg Glu Glu Gly Tyr Pro Ser Val Phe Tyr Ala 
Asp Tyr Tyr Gly Ala Gin Tyr Ser Asp Lys Gly Tyr Asn He Asn Met Ala Lys Val Pro Tyr He Glu 
Glu Leu Val Thr Leu Arg Lys Glu Tyr Ala Tyr Gly Lys Gin Asn Ser Tyr Leu Asp His Trp Asp Val 
He Gly Trp Thr Arg Glu Gly Asp Ala Glu His Pro Asn Ser Met Ala Val He Met Ser Asp Gly Pro 
Gly Gly Lys Lys Trp Met Tyr Thr Gly Lys Pro Ser Thr Arg Tyr Val Asp Lys Leu Gly He Arg Thr 
Glu Glu Val Trp Thr Asp Thr Asn Gly Trp Ala Glu Phe Pro Val Asn Gly Gly Ser Val Ser Val Tip 
Val Gly Val Lys 

SEQBDNO: 171 

gtgtatgtaaactctttttacgatgcaaataaagatggacatggtgatttaaaaggtcttacacaaaagttggattatttaaatgatggcaattctcata 

caaagaatgatcttcaagtaaacgggatttggatgatgccggtcaacccttctcccagctatcataaatatgatgtaacggactattataatattgat 

ccgcagtatggaaatctgcaagattttcgcaaactgatgaaagaagcagataaacgagatgtaaaagtcattatggacctcgttgtgaatcatac 

gagcagtgaacacccttggtttcaagctgcattaaaagataaaaacagcaagtacagagattactatatctgggctgataaaaataccgacttga 

atgaaaaaggatcttggggacagcaagtatggcataaagccccaaacggagagtatttttacggaacgttttgggaaggaatgccggacttaaa 

ttacgataatcctgaagtaagaaaagaaatgattaacgtaggaaagttttggctaaagcaaggagttgacgggttccgtctagatgctgcgcttca 

tatttttaaaggccaaacacctgaaggcgctaagaaaaatctcctgtggtggaatgaatttagagatgcaatgaaaaaggaaaaccctaacgtat 

atctaacgggtgaagtatgggatcaaccggaagtagtagctccttactatcaatcgcttgattctttatttaactttgatttagcaggaaagattgtaa 

actctgtaaaatcaggaaatgatcaaggaatcgcgactgcagcagcggcaacggatgaactgttcaaatcatacaatccaaataaaattgacgg 

tattttcttaaccaaccatgaccaaaatcgcgtcatgagtgagctaagcggcgatgtgaataaagcaaagtcagctgcctctatcttacttacgctt 

cctggcaacccgtatatttattacggtgaagaaatcggcatgaccggtgaaaagcctgatgagttaatccgtgaaccgttccgctggtacgaagg 

aaacggacttggacaaaccagctgggaaacacctgtatacaacaaaggcggcaacggcgtgtctgtagaagcacaaacaaaacaaaaggac 

tctttgttaaatcattaccgtgaaatgattcgcgtgcgtcagcagcacgaagagttagtaaaaggaacgcttcaatctatttcagtagacagtaaag 

aagtcgttgcctatagccgcacgtataaaggcaaatcgattagcgtgtatcataatatttcaaatcaaccggtaaaagtatctgtag 

gtaaattgatttttggtagtgaaaaaggtgctaagaaagtcaaaaatcagcttgtgattccggcgaatacaacggttttaataaaataa 

SEQIDNO: 172 

Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His Gly Asp Leu Lys Gly Leu Thr Gin 
Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn Asp Leu Gin Val Asn Gly He Tip 
Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val Thr Asp Tyr Tyr Asn He Asp Pro 
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Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala Asp Lys Arg Asp Val Lys Val 
He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe Gin Ala Ala Leu Lys Asp Lys 
Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn TTir Asp Leu Asn Glu Lys Gly Ser Trp 
Gly Gin Gin Val Tip His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly Thr Phe Trp Glu Gly Met Pro 
Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val Gly Lys Phe Trp Leu Lys 
Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly Gin Thr Pro Glu Gly Ala 
Lys Lys Asn Leu Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu Asn Pro Asn Val Tyr 
Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser Leu Asp Ser Leu Phe 
Asn Phe Asp Leu Ala Gly Lys He Val Asn Ser Val Lys Ser Gly Asn Asp Gin Gly He Ala Thr Ala 
Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly He Phe Leu Thr Asn 
His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala Lys Ser Ala Ala Ser He 
Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr Gly Glu Lys Pro 
Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr Ser Trp Glu Thr 
Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys Asp Ser Leu Leu 
Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly Thr Leu Gin Ser 
He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Lys Ser He Ser Val Tyr His 
Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Gly Ser Glu Lys 
Gly Ala Lys Lys Val Lys Asn Gin Leu Val He Pro Ala Asn Thr Thr Val Leu He Lys 

SEQIDNO: 173 

atgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgcagaaactgtacataaaggtaaagctccaacagcagataaaaacggtgttttttatgaggtgtatgtaaactctttttacgatg 

caaataaagatggacatggtgatttaaaaggtctgacacaaaagttggattatttaaatgacggcaattctcatacaaagaatgatcttcaagtaaa 

cgggatttggatgatgccggtaaaccxttctcctagctatcataaatatgatgtaacggactattataacattgatcctcagtacggaa 

gatttccgcaaactgatgaaagaagcagataaacgagacgtaaaagttattatggaccttgttgtgaatcatacgagcagtgaacacccttggttt 

caagctgcactaaaagataaaaacagcaagtacagagattactatatttgggctgataaaaataccgatttgaatgaaaaaggatcttggggaca 

gcaagtatggcataaagctccaaacggagagtatttttacggaacgttctgggaaggaatgcctgacttaaanacgataaccctgaa^ 

aagaaatgattaacgtcggaaagttttggctaaagcaaggcgttgatggcttccgcttagatgctgcccttcatatctttaaaggtcaaactcctga 

aggcgctaagaaaaatctcctgtggtggaatgagtttagagatgcaatgaaaaaagaaaaccctaacgtatatctaacgggtgaagtatgggat 

cagccggaagtagtagctccttattatcaatcgcttgattccctatttaactttgatttagcaggaaaaattgtcagctctgtaaaagcaggaaatgat 

caaggaatcgccactgcagcagcggcaacggatgagctgttcaaatcatacaatccaaataaaattgacggcattttcttaaccaaccatgacca 

aaaccgcgtcatgagtgagctaagcggagatgtgaataaagcaaaatcagctgcttctatcttacttacgcttcctggaaatccgtatatttattacg 

gtgaagaaattggcatgaccggtgaaaagcctgatgaattaatccgtgaaccgttccgctggtacgaaggcaacggaattggacaaactagct 

gggaaacacctgtatataacaaaggcggcaatggtgtgtctgtagaagcacaaaccaaacaaaaggattctttgttaaatcattaccgtgaaatg 

attcgcgtgcgtcagcagcacgaagagttagtaaaaggaacgcttcagtctatttcagtagacagtaaagaagttgtcgcttatagccgtacgta^ 

aaaggcaactccattagtgtgtatcataatatttcaaatcaacctgtaaaagtatctgtagcggcgaaaggtaaattgatttttgctagtgaaaaagg 

tgctaaaaaaggcaaaaatcagcttgtgattccggcgaatgcgacggttttaataaaataa 

SEQ ID NO: 174 

Met Gin Thr He Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ala Pro Thr 
Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His 
Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly He Trp Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Ser Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Trp Gly Gin Gin Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Trp Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly 
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Gin Thr Pro Glu Gly Ala Lys Lys Asn Leu Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys 
Glu Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Tip Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin 
Ser Leu Asp Set Leu Phe Asn Phe Asp Leu Ala Gly Lys lie Val Ser Ser Val Lys Ala Gly Asn Asp 
Gin Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp 
Gly He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala 
Lys Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met 
Thr Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly He Gly Gin 
Thr Ser Tip Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin 
Lys Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys 
Gly Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Asn 
Ser He Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He 
Phe Ala Ser Glu Lys Gly Ala Lys Lys Gly Lys Asn Ghi Leu Val He Pro Ala Asn Ala Thr Val Leu 
He Lys 

SEQIDNO: 175 

atgaaaaatataatacgactttgtgctgccagcgctatcctcacggtgtcccacgccagttacgccgacgcaattttacacgcgtttaactggcaat 

ataccgatgtaaccgccaatgcaaatcaaattgccgcaaatggctttaaaaaagtcctcatttcacccgcaatgaaatccagcggcagtcaatgg 

tgggcccgctatcaaccgcaagacttgcgtgtcattgattctccgctgggcaacaaacaagatttagtcgcgatgatcaatgcgctcaacagcgt 

tggggtcgacgtgtatgctgacgtggtgcttaaccatatggctaacgagtcatggaagcgcagtgacctgaactacccggggagtgaggtgct 

caacgactatcaatcccgcagtgcttactatcaaaggcaaacacttttcggcaatttacaggagaaccttttttccgagaatgatttccatccggca 

ggctgtattaccaattggaatgatcctggccacgtccagtattggcgcttgtgcggcggacagggcgatactgggctaccggatctcgatcc 

tcaatgggttgtgagtcagcagaagagttacttgaacgcactcaaatcaatgggaatcaaagggttccgtatcgatgcggtcaaacatatgagtc 

aatatcaaatagaccaagtgtttaccccagacaltaccgctggtatgcatatattcggagaagtcattaccagtggtgggcaaggtgatagcggct 

atgaggcttttcttgccccttaccttaataataccgatcacgccgcttatgacttcccgctatttgcatcgattcgagccgcgttttcattctc^^ 

gttaaatcagctacacaatccacaagcctatggccaagcgttacaggactcacgtgcgatcacctttacgattacccacgacattccaaccaatg 

acggtttccgctaccagatcatggatccaaccgatgaacagctcgcctatgcctacatcttgggcaaagatggaggaacgccacttgtctatagt 

gatgacctacctgacagcgaagacaaagacagtggtcgttgggccgatgtgtggcaagatccgaacatgattaacatgcttgccttccacaacg 

cgatgcaaggacaaagcatgactgtagtggctagcgatcaatgtaccttgctatttaagcgcggcaagcaaggcgtggtaggaatcaataaatg 

tggcgagagtaagtcggtgactgtcgatacttaccagcatgagtttaactggtacaccccgtaccaagacgtattgagcggcgacatcaccaca 

gtgagttctcgttatcaccaatttgttttgccagcgcgcagtgcaaggatgtggaaactataa 

SEQIDNO: 176 

Met Lys Asn He He Arg Leu Cys Ala Ala Ser Ala He Leu Thr Val Ser His Ala Ser Tyr Ala Asp Ala 
He Leu His Ala Phe Asn Tip Gin Tyr Thr Asp Val Thr Ala Asn Ala Asn Gin lie Ala Ala Asn Gly 
Phe Lys Lys Val Leu He Ser Pro Ala Met Lys Ser Ser Gly Ser Gin Trp Tip Ala Arg Tyr Gin Pro 
Gin Asp Leu Arg Val He Asp Ser Pro Leu Gly Asn Lys Gin Asp Leu Val Ala Met He Asn Ala Leu 
Asn Ser Val Gly Val Asp Val Tyr Ala Asp Val Val Leu Asn His Met Ala Asn Glu Ser Tip Lys Arg 
Ser Asp Leu Asn Tyr Pro Gly Ser Glu Val Leu Asn Asp Tyr Gin Ser Arg Ser Ala Tyr Tyr Gin Arg 
Gin Thr Leu Phe Gly Asn Leu Gin Glu Asn Leu Phe Ser Glu Asn Asp Phe His Pro Ala Gly Cys He 
Thr Asn Trp Asn Asp Pro Gly His Val Gin Tyr Trp Arg Leu Cys Gly Gly Ghi Gly Asp Thr Gly 
Leu Pro Asp Leu Asp Pro Asn Gin Tip Val Val Ser Gin Gin Lys Ser Tyr Leu Asn Ala Leu Lys Ser 
Met Gly He Lys Gly Phe Arg lie Asp Ala Val Lys His Met Ser Gin Tyr Gin He Asp Gin Val Phe 
Thr Pro Asp He Thr Ala Gly Met His He Phe Gly Glu Val lie Thr Ser Gly Gly Gin Gly Asp Ser Gly 
Tyr Glu Ala Phe Leu Ala Pro Tyr Leu Asn Asn Thr Asp His Ala Ala Tyr Asp Phe Pro Leu Phe Ala 
Ser He Arg Ala Ala Phe Ser Phe Ser Gly Gly Leu Asn Gin Leu His Asn Pro Gin Ala Tyr Gly Gin . 
Ala Leu Gin Asp Ser Arg Ala lie Thr Phe Thr He Thr His Asp He Pro Thr Asn Asp Gly Phe Arg 
Tyr Gin He Met Asp Pro Thr Asp Glu Gin Leu Ala Tyr Ala Tyr He Leu Gly Lys Asp Gly Gly Thr 
Pro Leu Val Tyr Ser Asp Asp Leu Pro Asp Ser Glu Asp Lys Asp Ser Gly Arg Trp Ala Asp Val Trp 
Gin Asp Pro Asn Met He Asn Met Leu Ala Phe His Asn Ala Met Gin Gly Gin Ser Met Thr Val Val 
Ala Ser Asp Gin Cys Thr Leu Leu Phe Lys Arg Gly Lys Gin Gly Val Val Gly He Asn Lys Cys Gly 
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GIu Ser Lys Ser Val Thr Val Asp Thr Tyr Gin His Glu Phe Asn Tip Tyr Thr Pro Tyr Gin Asp Val 
Leu Ser Gly Asp He Thr Thr Val Ser Ser Arg Tyr His Gin Phe Val Leu Pro Ala Arg Ser Ala Arg 
Met Tip Lys Leu 

SEQIDNO: 177 

atgaaaacattcaaattaaaacgcacttttttaccgctgaccttgctgctcagtgctcctgccmgctgggcaaaatggcaccatgatgcagtat^ 

cattggtacgtacctaatgatggcgcaUatggacgcaggttgaaagcaatgctccagtactcgctgaaaacggtttte 

gcccgcatacaaaggcgcgggcggcagtaatgacgtcggttatggcgtctatgatatgtacgatttaggtgagmgaccaaaa 

cgaaccaaatacggcaccaaggctcagtacatctctgcaatcaatgccgcgcacaacaacaatatccaaamacggcgacgttgtgtttaacca 

ccgaggtggcgctgatgggaagtcgtgggtcgataccaagcgcgUgattgggacaaccgcaatattgaactgggcgacaaatggattgaag 

cttgggttgagtttaattttcctggccgcaacgacaaatactcgaacttccattggacttggtatcacmgacggtgttgactgggatgat^^ 

aaagaaaaagcgatctttaaattcaaaggcgaaggaaaagcatgggattgggaagtcagctctgaaaaaggcaattacgactacctaatgtac 

gccgattUgacatggatcacccagaagttaaacaagagctgaaagattggggtgagtggtacatcaacatgaccggcgttgat^^^ 

tggatgccgtgaagcacattaaatatcagtatctacaagagtggattgatcatttacgttggaaaacaggcaaagagctmcaccg^^ 

ttggaattacgacgtaaatcaactgcacaactttattactaagacctctggcagtatgtcgttgttcgatgcgccgcttcacatgaatttctaca^^ 

gtcaaaatctggcggcacttacgatatgcgccaaatcatgaatggcacgttgatgaaggacaacccagtcaaagcagtgactctcgtagaaaac 

cacgatacgcagccattgcaggcgttagagtcgacagtagattggtggttcaagcctcttgcttacgcattcattttattgcgtgaagaaggte 

catcggtgttctacgcagattactacggcgcgcagtacagcgacaaaggttacaacattaatatggccaaagtgccttacattgaag 

cactgcgtaaagagtatgcg^atggcaaacagaattcttatctcgaccattgggatgtgattggctggacccgagagggcgatgctgaacatcc 

aaactcaatggcggtgatcatgagtgatggaccgggcggcacaaaatggatgtataccggtaagccaagtacgcgctatgtcgacaagctgg 

gtatccgaactgaagatgmggaccgatgccaatggctgggcagaatttcctg^caatggtggttcagtctcggmgggtgggcgttaagte^ 

SEQIDNO: 178 

Met Lys Thr Phe Lys Leu Lys Arg Thr Phe Leu Pro Leu Thr Leu Leu Leu Ser Ala Pro Ala Phe Ala 
Gly Gin Asn Gly Thr Met Met Gin Tyr Phe His Tip Tyr Val Pro Asn Asp Gly Ala Leu Tip Thr Gin 
Val Glu Ser Asn Ala Pro Val Leu Ala Glu Asn Gly Phe Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
Gly Ala Gly Gly Ser Asn Asp Val Gly Tyr Gly Val Tyr Asp Met Tyr Asp Leu Gly Glu Phe Asp 
Gin Lys Gly Ser Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr He Ser Ala He Asn Ala Ala His 
Asn Asn Asn lie Gin He Tyr Gly Asp Val Val Phe Asn His Arg Gly Gly Ala Asp Gly Lys Ser Tip 
Val Asp Thr Lys Arg Val Asp Tip Asp Asn Arg Asn He Glu Leu Gly Asp Lys Tip lie Glu Ala Trp 
Val Glu Phe Asn Phe Pro Gly Arg Asn Asp Lys Tyr Ser Asn Phe His Trp Thr Trp Tyr His Phe Asp 
Gly Val Asp Trp Asp Asp Ala Gly Lys Glu Lys Ala He Phe Lys Phe Lys Gly Glu Gly Lys Ala Tip 
Asp Trp Glu Val Ser Ser Glu Lys Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 
His Pro Glu Val Lys Gin Glu Leu Lys Asp Tip Gly Glu Tip Tyr He Asn Met Thr Gly Val Asp Gly 
Phe Arg Met Asp Ala Val Lys His He Lys Tyr Gin Tyr Leu Gin Glu Trp He Asp His Leu Arg Tip 
Lys Thr Gly Lys Glu Leu Phe Thr Val Gly Glu Tyr Trp Asn Tyr Asp Val Asn Gin Leu His Asn 
Phe lie Thr Lys Thr Ser Gly Ser Met Ser Leu Phe Asp Ala Pro Leu His Met Asn Phe Tyr Asn Ala 
Ser Lys Ser Gly Gly Thr Tyr Asp Met Arg Gin He Met Asn Gly Thr Leu Met Lys Asp Asn Pro Val 
Lys Ala Val Thr Leu Val Glu Asn His Asp TTir Gin Pro Leu Gin Ala Leu Glu Ser Thr Val Asp Tip 
Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Leu Arg Glu Glu Gly Tyr Pro Ser Val Phe Tyr Ala 
Asp Tyr Tyr Gly Ala Gbi Tyr Ser Asp Lys Gly Tyr Asn lie Asn Met Ala Lys Val Pro Tyr lie Glu 
Glu Leu Val Thr Leu Arg Lys Glu Tyr Ala Tyr Gly Lys Gin Asn Ser Tyr Leu Asp His Tip Asp Val 
He Gly Tip Thr Arg Glu Gly Asp Ala Glu His Pro Asn Ser Met Ala Val He Met Ser Asp Gly Pro 
Gly Gly Thr Lys Trp Met Tyr Thr Gly Lys Pro Ser Thr Arg Tyr Val Asp Lys Leu Gly lie Arg Thr 
Glu Asp Val Trp Thr Asp Ala Asn Gly Tip Ala Glu Phe Pro Val Asn Gly Gly Ser Val Ser Val Tip 
Val Gly Val Lys 

SEQIDNO: 179 

atgaaaacattcaaattaaaacgcacttttttaccgctaaccttgctgctcagtgctcctgcctttgccgggcaaaatggcaccatgatgcagtactt 
tcattggtacgtacctaatgatggcgcattatggacgcaggttgaaagcaatgctccagcactcgctgaaaacggttttacagcgctctggctacc 
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gccagcttacaaaggcgcgggcggcagtaatgatgtcggttatggcgtctacgatatgtacgatttaggtgagmgatcaaaaag 

gaaccaaatacggtaccaaggctcagtacatctctgcaatcaatgctgcgcacaacaacaatatccaaatttacggcgacgUg^ 

gtggtggcgctgatgggaagtcgtgggtcgataccaagcgcgttgattgggacaacxgtaacangaactgggcgaca 

gggttgagWaatmcctagccgcaacgacaaatactcgaacttccattggacttggtatcactttgacggtgttgactgggatgatgccggcaa 

agaaaaagcgatctttaaattcaaaggcgaaggaaaagcatgggattgggaagtcagctctgaaaaaggcaattacgactacctaatgta^^^ 

cgatttagacatggatcacccagaagttaaacaagagctgaaagattggggtgagtggtacatcaacatgaccggc^^ 

gatgccg^taagcacattaaatatcagtatctacaagagtggattgatcatttacgttggaaaacaggcaaagagctm^ 

gaattacgacgtaaatcaactgcataacttmttactaagacctctggcagtatgtcgUgttcgatgcgc^ 

caaaatctggcggcaattacgatatgcgccaaatcatgaatggcacgttgatgaaggacaacccagtcaaagc^^^ 

cgatacgcagccattgcaggcgttagagtcgacagtggattggtggttcaagcctcttgcttacgcattcatcttgttgcgtgaaga 

tcggtgUctacgcagattactacggcgcgcagtacagcgacaaaggttacaacattaatatggccaaagtgccttac^^^ 

ctgcgtaaagagtatgcgtatggcaaacagaattcttatctogaccangggatgtgatt^^ 

ctcaatggcggtgatcatgagtgatggaccgggcggaacaaaatggatgtataccggtaatccaagcacgcgctatgtcgacaagctgg 
ccgaactgaagatgmggaccgatgccaatggctgggcagaamcctgtcaatggtggttcagtctcggWgggtgggcgttaag^ 

SEQIDNO: 180 

Met Lys Thr Phe Lys Leu Lys Arg Thr Phe Leu Pro Leu Thr Leu Leu Leu Ser Ala Pro Ala Phe Ala 
Gly Gin Asn Gly Thr Met Met Gin Tyr Phe His Trp Tyr Val Pro Asn Asp Gly Ala Leu Trp Thr Gin 
Val Glu Ser Asn Ala Pro Ala Leu Ala Glu Asn Gly Phe Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
Gly Ala Gly Gly Ser Asn Asp Val Gly Tyr Gly Val Tyr Asp Met Tyr Asp Leu Gly Glu Phe Asp 
Ghi Lys Gly Ser Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr lie Ser Ala He Asn Ala Ala His 
Asn Asn Asn He Gin He Tyr Gly Asp Val Val Phe Asn His Arg Gly Gly Ala Asp Gly Lys Ser Trp 
Val Asp Thr Lys Arg Val Asp Trp Asp Asn Arg Asn lie Glu Leu Gly Asp Lys Trp He Glu Ala Trp 
Val Glu Phe Asn Phe Pro Ser Arg Asn Asp Lys Tyr Ser Asn Phe His Trp Thr Tip Tyr His Phe Asp 
Gly Val Asp Trp Asp Asp Ala Gly Lys Glu Lys Ala He Phe Lys Phe Lys Gly Glu Gly Lys Ala Tip 
Asp Tip Glu Val Ser Ser Glu Lys Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 
His Pro Glu Val Lys Gin Glu Leu Lys Asp Trp Gly Glu Trp Tyr He Asn Met Thr Gly Val Asp Gly 
Phe Arg Met Asp Ala Val Lys His He Lys Tyr Gin Tyr Leu Gin Glu Trp He Asp His Leu Arg Trp 
Lys Thr Gly Lys Glu Leu Phe Thr Val Gly Glu Tyr Tip Asn Tyr Asp Val Asn Gin Leu His Asn 
Phe He Thr Lys Thr Ser Gly Ser Met Ser Leu Phe Asp Ala Pro Leu His Met Asn Phe Tyr Asn Ala 
Ser Lys Ser Gly Gly Asn Tyr Asp Met Arg Gta lie Met Asn Gly Thr Leu Met Lys Asp Asn Pro Val 
Lys Ala Val Thr Leu Val Glu Asn His Asp Thr Gto Pro Leu Gin Ala Leu Glu Ser Thr Val Asp Tip 
Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Leu Arg Glu Glu Gly Tyr Pro Ser Val Phe Tyr Ala 
Asp Tyr Tyr Gly Ala Ghi Tyr Ser Asp Lys Gly Tyr Asn lie Asn Met Ala Lys Val Pro Tyr He Glu 
Glu Leu Val Thr Leu Arg Lys Glu Tyr Ala Tyr Gly Lys Gbi Asn Ser Tyr Leu Asp His Trp Asp Val 
He Gly Trp Thr Arg Glu Gly Asp Ala Glu His Pro Asn Ser Met Ala Val He Met Ser Asp Gly Pro 
Gly Gly Thr Lys Trp Met Tyr Thr Gly Asn Pro Ser Thr Arg Tyr Val Asp Lys Leu Gly He Arg Thr 
Glu Asp Val Tip Thr Asp Ala Asn Gly Trp Ala Glu Phe Pro Val Asn Gly Gly Ser Val Ser Val Tip 
Val Gly Val Lys 

SEQIDNO: 181 

ttgccagaggccttcggcctggccattacgccgtcacatagccggcgggggaggttggtgggcgtgtcgcgcgggggcagcctgccgatgc 

cggtcctccactggccggcgttcatcctcgtccggcgcttcgtcgccggtcatccgaacaagcacaagaaccggagtattgcgatgagccaca 

ccctgcgtgccgccgtattggcggcgatcctgctgccgttccccgccx:tcgctgaccaggccggcaagagcccggccggcgtgcgct^^ 

cggcggcgacgaaatcatcctccagggcttccactggaacgtcgtccgcgaagcgcccaacgactggtacaacatccttcgccagcaggcct 

cgacgatcgccgcggacggcttctcggcaatctggatgccggtgccctggcgtgacttctccagctggaccgacggcggcaagtcaggcgg 

cggcgaaggctacttctggcacgacttcaacaagaacggccgctacggcagcgacgcccagctgcgccaggccgccggcgcactcggtgg 

cg<^cggggtg^^aggtgctctacgatgtggtgcccaatcacatgaaccgcggctatccggacaaggagatcaacctgccggccggccagggc 

ttctggcgcaacgactgcaccgacccgggcaactaccccaacgactgcgatgacggtgaccgcttcatcggcggcaagtcggacctgaaca 

ccggccatccgcagatctacggcatgtttcgcgacgagcttgccaacctgcgcagcgggtacggcgccggcggcttccgc^^ 
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gcggctatgcgcccgaacgggtcgacagctggatgagcgacagcgccgacagcagtttctgcgttggcgagctgtggaaaagcccgtccga 

gtacccgagctgggactggcgcaacacggcgagctggcagcagatcatcaaggactggtccgaccgggccaagtgcccg^ 

gcgctcaaggagcgcatgcagaacggctcggtcgccgactggaagcatggcctcaatggcaacccggac^^ 

ggtgacctttgtcgacaaccacgacaccggctattcgcccgggcagaacggcggccagcaccactgggcgctgcaggacgggctgatccg 

ccaggcctacgcctacatcctcaccagcccgggcacgccggtggtgtactggtcgcacatgtacgactggggctacggcgacUc^ 

gctgatccaggtgcggcgcaccgctggcgtgcgcgccgattcggcgatcagcttccacagcggctacagcggcctggtcgcta^ 

ggcagccatcagaccctggtggtggcgctcaactccgatctggccaaccccggccaggtcgccagcggcagcttcagcgaggcggtcaac 

gccagcaacggccaggtgcgcgtctggcgcagcggtagcggcgatggcggcggcaatgacggcggcgagggcggtctggtcaa^^^^ 

cttccgctgcgacaacggcgtgacgcagatgggcgacagcgtctacgcggtgggcaacgtcagccagctcggcaact^^ 

cgcggtacggctgaccgacaccagcagctatccgacctggaagggcagcatcgccctgcctgacggtcagaacgtggaatggaagtgcct^ 

atccgtaacgaggcggacgcgacgctggtgcgccagtggcaatcgggcggcaacaaccaggtccaggccgctgccggcgcgagcacca 

gcggctcgttctga 

SEQIDNO:182 

Met Pro Glu Ala Phe Gly Leu Ala He Thr Pro Ser His Ser Arg Arg Gly Arg Leu Val Gly Val Set 
Arg Gly Gly Ser Leu Pro Met Pro Val Leu His Tip Pro Ala Phe He Leu Val Arg Arg Phe Val Ala 
Gly His Pro Asn Lys His Lys Asn Arg Ser He Ala Met Ser His Thr Leu Arg Ala Ala Val Leu Ala 
Ala He Leu Leu Pro Phe Pro Ala Leu Ala Asp Gin Ala Gly Lys Ser Pro Ala Gly Val Arg Tyr His 
Gly Gly Asp Glu He He Leu Gin Gly Phe His Tip Asn Val Val Arg Glu Ala Pro Asn Asp Trp Tyr 
Asn He Leu Arg Gin Gin Ala Ser Thr He Ala Ala Asp Gly Phe Ser Ala He Trp Met Pro Val Pro Tip 
Arg Asp Phe Ser Ser Trp Thr Asp Gly Gly Lys Ser Gly Gly Gly Glu Gly Tyr Phe Tip His Asp Phe 
Asn Lys Asn Gly Arg Tyr Gly Ser Asp Ala Ghi Leu Arg Gin Ala Ala Gly Ala Leu Gly Gly Ala 
Gly Val Lys Val Leu Tyr Asp Val Val Pro Asn His Met Asn Arg Gly Tyr Pro Asp Lys Glu lie Asn 
Leu Pro Ala Gly Gin Gly Phe Tip Arg Asn Asp Cys Thr Asp Pro Gly Asn Tyr Pro Asn Asp Cys 
Asp Asp Gly Asp Arg Phe He Gly Gly Lys Ser Asp Leu Asn Thr Gly His Pro Gin He Tyr Gly Met 
Phe Arg Asp Glu Leu Ala Asn Leu Arg Ser Gly Tyr Gly Ala Gly Gly Phe Arg Phe Asp Phe Val 
Arg Gly Tyr Ala Pro Glu Arg Val Asp Ser Tip Met Ser Asp Ser Ala Asp Ser Ser Phe Cys Val Gly 
Glu Leu Trp Lys Ser Pro Ser Glu Tyr Pro Ser Tip Asp Tip Arg Asn Thr Ala Ser Trp Gin Gin He He 
Lys Asp Trp Ser Asp Arg Ala Lys Cys Pro Val Phe Asp Phe Ala Leu Lys Glu Arg Met Gin Asn 
Gly Ser Val Ala Asp Tip Lys His Gly Leu Asn Gly Asn Pro Asp Pro Arg Tip Arg Glu Val Ala Val 
Thr Phe Val Asp Asn His Asp Thr Gly Tyr Ser Pro Gly Gin Asn Gly Gly Gin His His Tip Ala Leu 
Gin Asp Gly Leu He Arg Gin Ala Tyr Ala Tyr He Leu Thr Ser Pro Gly Thr Pro Val Val Tyr Tip Ser 
His Met Tyr Asp Tip Gly Tyr Gly Asp Phe He Arg Gin Leu He Gin Val Arg Arg Thr Ala Gly Val 
Arg Ala Asp Ser Ala He Ser Phe His Ser Gly Tyr Ser Gly Leu Val Ala Thr Val Ser Gly Ser His Gin 
Thr Leu Val Val Ala Leu Asn Ser Asp Leu Ala Asn Pro Gly Gin Val Ala Ser Gly Ser Phe Ser Glu 
Ala Val Asn Ala Ser Asn Gly Gin Val Arg Val Trp Arg Ser Gly Ser Gly Asp Gly Gly Gly Asn Asp 
Gly Gly Glu Gly Gly Leu Val Asn Val Asn Phe Arg Cys Asp Asn Gly Val Thr Gin Met Gly Asp 
Ser Val Tyr Ala Val Gly Asn Val Ser Gin Leu Gly Asn Trp Ser Pro Ala Ser Ala Val Arg Leu Thr 
Asp Thr Ser Ser Tyr Pro Thr Trp Lys Gly Ser He Ala Leu Pro Asp Gly Gin Asn Val Glu Trp Lys 
Cys Leu He Arg Asn Glu Ala Asp Ala Thr Leu Val Arg Gin Trp Gin Ser Gly Gly Asn Asn Gin Val 
Gin Ala Ala Ala Gly Ala Ser Thr Ser Gly Ser Phe 

SEQroNO: 183 

atgcaaacgattgcaaaaaaaggggatgaaacga^aaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgccgaaaccgtacataaaggtaagtctgaagcaacagataaaaacggtgtcttttatgaggtgtatgtaaactcttttta 

caaataaagatggacatggtgatttaaaaggtctgacacaaaagttggattatttaaatgacggcaattctcatacaaagaatgatcttcaagtaaa 

cgggatttggatgatgccagtcaacccttctcctagctatcataaatatgatgtaacggactattataacattgatcctca^^ 

attttcgcaagctgatgaaagaagcagacaaacgagacgtaaaagtcattatggaccttgttgtgaatcatacgagcagcgaacacccttggttt 

caagctgcattaaaagataaaaacagcaagtacagagattactatatttgggctgataaaaataccgatttgaatgaaaaaggatcttgggggca 

gcaagtatggcataaagctccaaacggagagtatttttacggaacgttttgggaaggaatgcctgacttaaattacgataaccctg^ 
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aagaaatgattaacgtcggaaagtmggctaaagcaaggcgttaatggcttccgcttagatgctgcgcttcatattt^ 
aggcgctaagaaaaatatcctgtggtggaatgagtttagagatgcgatgaaaaaagaaaaccctaacgtatatc^^ 
cagcctgaagtggtagctccttactatoaatcgcngattctttatttaatmgatttagcaggmaatt^ 

aaggaatcgccactgcagcagcggcaacagatgaactgttcaaatcatacaatccaaataaaattgacggcatmcttaaccaaccatgaccaa 
aategcgtcatgagtgagctgagcggcgatgtgaacaaagcaaaatcagctgcttctatcttacttacgcncctggcaacccgto 



gggaaacacctgtatataacaaaggcggcaacggcgtgtctgtagaagtacaaaccaaacaaaaggattcmgttaaatcato^ 
attcgcgtgcgtcagcagcatgaagagttag^aaaaggaacgcttcaatctatttcagtagacagtaaagaagtggttgcctata^^ 



tgctaaaaaagtcaaaaatca^t^attccggctaatacaacggttttaataaaataa 
SEQIDNO:184 

Met Gin Thr Ee Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ser Glu Ala 
Thr Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Thr Asn Lys Asp Gly His 
Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly He Trp Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Asn Leu Gbi Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Trp Gly Gin Gto Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Trp Leu Lys Gin Gly Val Asn Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly 
Gin Thr Pro Glu Gly Ala Lys Lys Asn He Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu 
Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser 
Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Ser Ser Val Lys Ala Gly Asn Asp Gin 
Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly 
He Phe Leu Thr Asn His Asp Ghi Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala Lys 
Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr 
Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr 
Ser Trp Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Val Gin Thr Lys Gin Lys 
Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gto Gin His Glu Glu Leu Val Lys Gly 
Thr Leu Ghi Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Asn Ser He 
Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Ala 
Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val He Pro Ala Asn Thr Thr Val Leu He Lys 

SEQIDNO:185 

atgaaactgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatcaacaggcgttcacgccgaaactg^ 

ataaaggtaaagctccaacagcagataaaaacggtgtcttttatgaggtgtatgtaaactctttttacgatgcaaataaagatggacatg^ 

aaaggtcttacacaaaagctggactatttaaatgacggaaattctcatacaaagaatgatcttcaagtaaacgggatttggatgatg 



agacaaacgagacgtaaaagtcattatggaccttgttgtgaatcatacgagcagcgaacacccttggtttcaagctgcgttaaaagataaaaaca 
gcaagtacagagattactatatttgggctgataaaaataccgacttgaatgaaaaaggatcttggggacagcaagtatggcataaagctccaaac 



ggctaaagcaaggcgttgatggcttccgcttagatgctgcgcttcatatttttaaaggtcaaacgcctgaaggcgctaagaaaaatattctg^^ 

ggaatgagtttagagatgcgatgaaaaaagaaaaccctaacgtatatctaacgggtgaagtatgggatcagcctgaagtggtagctccttactat 

caatcgcttgattccctatttaactttgatttagcagggaaaattgtcagttctgtaaaagcaggaaatgatcaagga^^ 

caacggatgagctgttcaaatcatacaatccaaataaaattgacggcatmcttaaccaaccatgaccaaaaccgcgtcatgagtgaa 

gcgatgtgaacaaagcaaaatcagctgcttctatcttacttacgcttcctggcaacccgtatatttattacggtgaagaaattg^^ 
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cggcaacggcgtgtctgtagaagcacaaaccaaacaaaaggattcmgttaaatcattaccgtgaaatgattcgcgtgc 
agttagtaaaaggaacgcttcaatctattttagtagacagtaaagaagttgttgcctatagccgtacgtataaaga 
aalattteaaatcaaccggtaaaagtatctgtagcagcaaaaggtaaattaattmgcta^ 
tgattccggctaatacaacggttttaataaaataa 

SEQIDNO:186 

Met Lys Leu Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr Leu Pro Leu Ala Ala Ser Leu Ser 
Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ala Pro Thr Ala Asp Lys Asn Gly Val Phe Tyr 
Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His Gly Asp Leu Lys Gly Leu Thr 
Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn Asp Leu Gin Val Asn Gly He Trp 
Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val Thr Asp Tyr Tyr Asn He Asp Pro 
Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala Asp Lys Arg Asp Val Lys Val 
He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe Gin Ala Ala Leu Lys Asp Lys 
Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp Leu Asn Glu Lys Gly Ser Trp 
Gly Gin Ghi Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly Thr Phe Trp Glu Gly Met Pro 
Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val Gly Lys Phe Trp Leu Lys 
Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly Gin Thr Pro Glu Gly Ala 
Lys Lys Asn He Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu Asn Pro Asn Val Tyr Leu 
Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser Leu Asp Ser Leu Phe Asn 
Phe Asp Leu Ala Gly Lys He Val Ser Ser Val Lys Ala Gly Asn Asp Ghi Gly He Ala Thr Ala Ala 
Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly He Phe Leu Thr Asn His 
Asp Gin Asn Arg Val Met Ser Glu Leu He Gly Asp Val Asn Lys Ala Lys Ser Ala Ala Ser He Leu 
Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr Gly Glu Lys Pro Asp 
Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr Ser Trp Glu Thr Pro 
Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys Asp Ser Leu Leu Asn 
His Tyr Arg Glu Met He Arg Val Arg Gb Gin His Glu Glu Leu Val Lys Gly Thr Leu Gin Ser He 
Leu Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Asp Asn Ser He Ser Val Tyr His 
Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Ala Ser Glu Lys 
Gly Ala Lys Lys Val Lys Asn Gta Leu Val He Pro Ala Asn Thr Thr Val Leu He Lys 

SEQ ID NO: 187 

ttgtatctcatccaggaggggcacatgcgttttccgcccattattcacccgcttaccggcctggccgttccggttggagctctgcgtaccgc^^ 

agctgcggcataggggagtttgccgacttgccggttcttgccgaattctgcaaaaaagccggatttgatcttgtacagcttcttccggtcaatga^ 

accggcacagaaagttctccatacagcgcgctttctgcctttgccctgcacccgctgtatatcaggctttccgacctgcctgaagcagcgggtttc 

gaaaagcagattacagatctgaaaagccggtttgaggacttgcctcgtttcagctatacggagctgcgccgtgccaaactggatatcctgcgtgc 

agtgtttgataaaaacaaggcaaccatcatcggcagtgccgaactggaagcctggatttcagataacccctggatcatcgaatatgcgg^^ 

gaaccagaaacaccgcaactttgaagccggctggaaacattgggaaaagctgcgcaaccccactcataacgaaatacaaaaaacctggcag 

ggtaaaacctggcaggctgaccatcaattctttgcatggctgcagatgcggctggaccagcagtttactgccgccgctacagagtgcaacgcc 

ctgggtgtctatcttaagggcgatatacctataatgatgaacgaggattccgcagatgcctgggcgaatccggaattcttccgtgacgatcto 

gccggaagtccccctgacggtgaaaacccccagggacaaaactggggcttccccatttataactgggaaaaccttgcaaatgacgggtac^^ 

ctggtggaaaaaacgtctgaagcacagcgcacggtattaccatgcctaccgcattgaccatattcttgggtttttccggatatgggc 

ggcgaatactccggctacctgggatggcccttgccgcatgaaccggtaagtgcagcagaactggcagaacggggcttttccaaggaccgctt 

gcgctggcttaccgaaccccacttgcctacacgggcagccgaggaagcgaataactgggactatctgggaacacacggctatctgaatc 

tcatgaaccgtatcggtgaagaagaactatggctgttcaagcccgagatcacctgcgaggcagatatacgaaacacaaacctgccggatgccc 

tgaaagaggttctggtacggcagtggaaaaaccggctgctgcaggttaccggccgcgacgaaaaaggacggacaatctactatccgctgtgg 

cgtttccgtgacagcactgcatggcagacgcttaccgatggcgagaaacactccctggaagagctgttcgcccaaaaagcggcgcacaatga 

aaccctgtggcgagaacaggcggtggaacttctgggtgagctgacgcgatctacggatatgcttgcctgtgctgaagatctgggaag^ 

cacagtgtaccggaagtgctttcaaacctttcaatttacagtctgcgggttacccgctgggcccgccaatgggatgcccccggccagccctttca 

cagactggaggagtatccgctcatgtcggtagcgaccccatcggttcatgattcctctaccctgcgcggatggtgggaaaccgaaggcggcga 

ccgggcctttatggacgcatggcctccggaacaggatgcatacgcaggagcaggccgccatgagttcgaaggcgcctggggaccccgcca 
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ggcatcctgggtactccgtaaactctgcgaagcccgttccgcgctctgtgtmccccatGcaggatatmggccctgtcttcagactt^ 
gacagcggacgaggaacgcatcaatattccgggcagtgtatccggatttaactggacataccggngcctgcggcaatcgagga^ 
aacagccaacttataaccgcaatccagaccgcgttgcaggaccgccgggcgaggaaggcacaaggagcac^^ 

SEQIDNO:188 

Met Tyr Leu lie Gin Glu Gly His Met Arg Phe Pro Pro He He His Pro Leu Thr Gly Leu Ala Val Pro 

Val Gly Ala Leu Arg Thr Ala Gin Ser Cys Gly He Gly Glu Phe Ala Asp Leu Pro Val Leu Ala Glu 
Phe Cys Lys Lys Ala Gly Phe Asp Leu Val Gin Leu Uu Pro Val Asn Asp Thr Gly Thr Glu Ser Ser 
Pro Tyr Ser Ala Leu Ser Ala Phe Ala Leu His Pro Leu Tyr He Arg Leu Ser Asp Leu Pro Glu Ala 
Ala Gly Phe Glu Lys Gin He Thr Asp Leu Lys Ser Arg Phe Glu Asp Leu Pro Arg Phe Ser Tyr Thr 
Glu Leu Arg Arg Ala Lys Leu Asp He Leu Arg Ala Val Phe Asp Lys Asn Lys Ala Thr He He Gly 
Ser Ala Glu Leu Glu Ala Trp He Ser Asp Asn Pro Tip He He Glu Tyr Ala Val Phe Met Asn Gin 
Lys His Arg Asn Phe Glu Ala Gly Trp Lys His Trp Glu Lys Leu Arg Asn Pro Thr His Asn Glu He 
Gin Lys Thr Tip Gin Gly Lys Thr Tip Gbi Ala Asp His Gin Phe Phe Ala Tip Leu Ghi Met Arg Leu 
Asp Gin Gin Phe Thr Ala Ala Ala Thr Glu Cys Asn Ala Leu Gly Val Tyr Leu Lys Gly Asp He Pro 
He Met Met Asn Glu Asp Ser Ala Asp Ala Trp Ala Asn Pro Glu Phe Phe Arg Asp Asp Leu Arg 
Ala Gly Ser Pro Pro Asp Gly Glu Asn Pro Gin Gly Gin Asn Trp Gly Phe Pro He Tyr Asn Trp Glu 
Asn Leu Ala Asn Asp Gly Tyr Ser Trp Trp Lys Lys Arg Leu Lys His Ser Ala Arg Tyr Tyr His Ala 
Tyr Arg He Asp His He Leu Gly Phe Phe Arg He Trp Ala He Pro Tyr Gly Glu Tyr Ser Gly Tyr Leu 
Gly Trp Pro Leu Pro His Glu Pro Val Ser Ala Ala Glu Leu Ala Glu Arg Gly Phe Ser Lys Asp Arg 
Leu Arg Tip Leu Thr Glu Pro His Leu Pro Thr Arg Ala Ala Glu Glu Ala Asn Asn Trp Asp Tyr Leu 
Gly Thr His Gly Tyr Leu Asn Gin He Met Asn Arg He Gly Glu Glu Glu Leu Tip Leu Phe Lys Pro 
Glu He Thr Cys Glu Ala Asp He Arg Asn Thr Asn Leu Pro Asp Ala Leu Lys Glu Val Leu Val Arg 
Gin Trp Lys Asn Arg Leu Leu Gin Val Thr Gly Arg Asp Glu Lys Gly Arg Thr He Tyr Tyr Pro Leu 
Trp Arg Phe Arg Asp Ser Thr Ala Trp Ghi Thr Leu Thr Asp Gly Glu Lys His Ser Leu Glu Glu Leu 
Phe Ala Gin Lys Ala Ala His Asn Glu Thr Leu Trp Arg Glu Ghi Ala Val Glu Leu Leu Gly Glu Leu 
Thr Arg Ser Thr Asp Met Leu Ala Cys Ala Glu Asp Leu Gly Ser He Pro His Ser Val Pro Glu Val 
Leu Ser Asn Leu Ser lie Tyr Ser Leu Arg Val Thr Arg Trp Ala Arg Gin Trp Asp Ala Pro Gly Gin 
Pro Phe His Arg Leu Glu Glu Tyr Pro Leu Met Ser Val Ala Thr Pro Ser Val His Asp Ser Ser Thr 
Leu Arg Gly Trp Trp Glu Thr Glu Gly Gly Asp Arg Ala Phe Met Asp Ala Trp Pro Pro Glu Gin 
Asp Ala Tyr Ala Gly Ala Gly Arg His Glu Phe Glu Gly Ala Trp Gly Pro Arg Gin Ala Ser Trp Val 
Leu Arg Lys Leu Cys Glu Ala Arg Ser Ala Leu Cys Val Phe Pro He Gbi Asp He Leu Ala Leu Ser 
Ser Asp Phe Tyr Ala Met Thr Ala Asp Glu Glu Arg He Asn He Pro Gly Ser Val Ser Gly Phe Asn 
Tip Thr Tyr Arg Leu Pro Ala Ala He Glu Asp Leu Ser Lys Asn Ser Gin Leu He Thr Ala He Gin Thr 
Ala Leu Gin Asp Arg Arg Ala Arg Lys Ala Gin Gly Ala Gin Gin 

SEQroNO:189 

atgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgccgaaaccgtacataaaggtaaatctccagctgcagataaaaacggtgtcttttatgaggtgtatgtaaactcttttte 

caaataaagatggacatggtgatttaaaaggtcttacacaaaaactggactatttaaatgatggcaattctcatacaaagaatgatcttcaagtaaa 

cgggatttggatgatgccgatcaacccttctcctagctatcataaatatgatgtaacggactattataacattgattctcagtacggaaatctgcaag 

attttcgcaagctaatgaaagaagcagataaacgagatgtaaaagttattatggacctcgttgtgaatcatacgagcagtgaacacccttggtttca 

agctgcgttaaaagataaaaacagcaagtacagagattactatatttgggctgataaaaataccgatttgaatgaaaaaggatcttggggacaac 

aagtatggcacaaagctccaaacggagagtatttttacggaacgttctgggaaggaatgcctgacttaaattacgataac^^ 

gaaatgattaacgtcggaaagttttggctaaagcaaggcgttgacggcttccgcttagatgctgcccttcatatctttaaaggtcaaacacctgaa 

ggcgctaagaaaaatattgtgtggtggaatgaatttagagatgcgatgaaaaaagaaaacccgaacgtatatctaacgggcgaagtatgggatc 

agccggaagtggtagctccttattatcagtcgcttgattccctatttaactttgatttagcaggaaaaattgtcagctctgtaaaagca 

aaggaatcgctactgcagcagcggcaacagatgaactgttcaaatcatacaatccaaataaaattgacggcattttcttaaccaatcatgaccaaa 

atcgcgtcatgagtgagttaagcggagatgtcaataaagcaaagtcagctgcctctatcttacttacgcttcctggaaatccgtatatttattacggt 

gaagaaatcggcatgaccggtgaaaagcctgatgaattaatccgtgaaccgttccgctggtacgaaggaaacggacttggacaaacta^ 
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gaaacacctgtatacaataaaggcggcaacggcgtgtctgtagaagcacaaaccaaacaaaaggactcmgttaaatca 
tcgcgtgcgtcagcagcacgaagagttagtaaaaggaacgcttcaatctatttcagtagacagtaaagaagttgttgcttatagccgtacgtataa 
aggcaactccattagtgtgtatcataatatttcaaatcaacctgtaaaagtatctgtagcagcgaaaggtaaattgatttUgcto^ 
ctaaaaaggtcaaaaatcagcttgtgattccggcgaatacaacggttttagtaaaataa 

SEQIDNO: 190 

Met Gin Thr lie Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Tip Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Ser Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ser Pro Ala 
Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His 
Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly He Tip Met Met Pro He Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Ser Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val lie Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Tip Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Trp Gly Gin Gin Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glu Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Trp Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His He Phe Lys Gly 
Gin Thr Pro Glu Gly Ala Lys Lys Asn He Val Trp Tip Asn Glu Phe Arg Asp Ala Met Lys Lys Glu 
Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Trp Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser 
Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Ser Ser Val Lys Ala Gly Asn Asp Gin 
Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly 
He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Asn Lys Ala Lys 
Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr He Tyr Tyr Gly Glu Glu He Gly Met Thr 
Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr 
Ser Trp Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys 
Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly 
Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Asn Ser He 
Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Ala 
Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val He Pro Ala Asn Thr Thr Val Leu Val Lys 

SEQIDNO: 191 

atgcaaacgattgcaaaaaaaggggatgaaacgatgaaagggaaaaaatggacagctttagctctaacactgccgctggctgctagcttatca 

acaggcgttcacgccgaaaccgtacataaaggtaaatctccaacagcagataaaaacggtgtcttttatgaagtgtatgtaaactctttttacgatg 

caaataaagatggacatggtgacttaaaaggtcttacacaaaagttggactatttaaatgacggcaattctcatacaaaaaatgatcttcaagtaaa 

cgggatttggatgatgccagtcaacccttctcctagctatcataaatatgatgtaacggactattataacattgatccgcagtacggaaatctgcaa 

gattttcgcaagctgatgaaagaagcagacaaacgagacgtaaaagtcattatggaccttgttgtgaatcatacgagcagtgaacacccttggttt 

caagctgcgttaaaagataaaaacagcaagtacagagattactatatttgggctgataaaaataccgacttgaatgaaaaaggatcttggggaca 

acaagtatggcataaagctccaaacggagagtatttttacggaacgttctgggaaggaatgcctgacttaaattacgataaccctgaagtaagaa 

aagaaatgattaacgtcggaaagttttggctaaagcaaggcgttgacgggttccgcttagatgctgcgcttcatatttttaaaggtcaaacagctga 

aggcgctaagaaaaatatcctgtggtggaatgagtttagagatgcgatgaaaaaagaaaatccgaatgtatatctaacgggtgaagtatgggat 

cagcctgaagtggtagctccttattatcaatcgcttgattctttatttaattttgatttagcaggaaaaattgtcagctctgtaaaagcaggaaatg^^ 

aaggaatcgccactgcagcagcagcaacagatgaactgttcaaatcatacaatccaaacaaaattgatggcatattcttaaccaaccatgaccaa 

aatcgcgtcatgagtgagctgagcggcgatgtgagcaaagcaaaatcagctgcttctatcttacttacgcttcctggcaacccgtatatttattacg 

gtgaagaaatcggcatgaccggtgaaaagcctgatgaattaatccgtgaaccgttccgctggtacgaaggaaacggacttggacaaaccagtt 

gggaaacacctgtatacaataaaggcggaaacggtgtgtctgtagaagcacaaaccaaacaaaaggattctttgttaaatcattaccgtgaaatg 

attcgcgtgcgtcagcagcatgaagagttagtaaaaggaacgcttcaatctatttcagtagacagtaaagaagttgttgcttatagccgtacgtata 

aaggcaactccattagtgtgtatcataatatttcaaatcaaccggtaaaagtatctgtagcagcgaaaggtaaattgatttttgctagtgaaaaaggt 

gctaagaaagtcaaaaatcagcttgtggttccggcgaatacaacggttttaatgaaataa 
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Met Gin Thr lie Ala Lys Lys Gly Asp Glu Thr Met Lys Gly Lys Lys Trp Thr Ala Leu Ala Leu Thr 
Leu Pro Leu Ala Ala Ser Leu Set Thr Gly Val His Ala Glu Thr Val His Lys Gly Lys Ser Pro Thr 
Ala Asp Lys Asn Gly Val Phe Tyr Glu Val Tyr Val Asn Ser Phe Tyr Asp Ala Asn Lys Asp Gly His 
Gly Asp Leu Lys Gly Leu Thr Gin Lys Leu Asp Tyr Leu Asn Asp Gly Asn Ser His Thr Lys Asn 
Asp Leu Gin Val Asn Gly lie Trp Met Met Pro Val Asn Pro Ser Pro Ser Tyr His Lys Tyr Asp Val 
Thr Asp Tyr Tyr Asn He Asp Pro Gin Tyr Gly Asn Leu Gin Asp Phe Arg Lys Leu Met Lys Glu Ala 
Asp Lys Arg Asp Val Lys Val He Met Asp Leu Val Val Asn His Thr Ser Ser Glu His Pro Trp Phe 
Gin Ala Ala Leu Lys Asp Lys Asn Ser Lys Tyr Arg Asp Tyr Tyr He Trp Ala Asp Lys Asn Thr Asp 
Leu Asn Glu Lys Gly Ser Trp Gly Gin Ghi Val Trp His Lys Ala Pro Asn Gly Glu Tyr Phe Tyr Gly 
Thr Phe Trp Glu Gly Met Pro Asp Leu Asn Tyr Asp Asn Pro Glii Val Arg Lys Glu Met He Asn Val 
Gly Lys Phe Trp Leu Lys Gin Gly Val Asp Gly Phe Arg Leu Asp Ala Ala Leu His lie Phe Lys Gly 
Gin Thr Ala Glu Gly Ala Lys Lys Asn He Leu Trp Trp Asn Glu Phe Arg Asp Ala Met Lys Lys Glu 
Asn Pro Asn Val Tyr Leu Thr Gly Glu Val Tip Asp Gin Pro Glu Val Val Ala Pro Tyr Tyr Gin Ser 
Leu Asp Ser Leu Phe Asn Phe Asp Leu Ala Gly Lys He Val Ser Ser Val Lys Ala Gly Asn Asp Gin 
Gly He Ala Thr Ala Ala Ala Ala Thr Asp Glu Leu Phe Lys Ser Tyr Asn Pro Asn Lys He Asp Gly 
He Phe Leu Thr Asn His Asp Gin Asn Arg Val Met Ser Glu Leu Ser Gly Asp Val Ser Lys Ala Lys 
Ser Ala Ala Ser He Leu Leu Thr Leu Pro Gly Asn Pro Tyr lie Tyr Tyr Gly Glu Glu He Gly Met Thr 
Gly Glu Lys Pro Asp Glu Leu He Arg Glu Pro Phe Arg Trp Tyr Glu Gly Asn Gly Leu Gly Gin Thr 
Ser Tip Glu Thr Pro Val Tyr Asn Lys Gly Gly Asn Gly Val Ser Val Glu Ala Gin Thr Lys Gin Lys 
Asp Ser Leu Leu Asn His Tyr Arg Glu Met He Arg Val Arg Gin Gin His Glu Glu Leu Val Lys Gly 
Thr Leu Gin Ser He Ser Val Asp Ser Lys Glu Val Val Ala Tyr Ser Arg Thr Tyr Lys Gly Asn Ser He 
Ser Val Tyr His Asn He Ser Asn Gin Pro Val Lys Val Ser Val Ala Ala Lys Gly Lys Leu He Phe Ala 
Ser Glu Lys Gly Ala Lys Lys Val Lys Asn Gin Leu Val Val Pro Ala Asn Thr Thr Val Leu Met Lys 

SEQID NO: 193 

atgaaattcaaaaagagtttatctgccgggctccttttgttcggaggtctgagcggtgtgacaccatccgtcgctgcggaggtgccacgaaccgc 

atttgtccatttattcgaatggagttggccggatattgccaccgaatgcgaaacctttcttggccctaaggggttctctgcggttcaggtgtctccgc 

cgcaaaaaagcgtcagcaatgctgcctggtgggcgcgctaccaacctgttagttactcttttgaagggcgcagtggaacccgggctcaatttgc 

ggatatggtccagcgttgtaaagcggtgggggtcgatatttatctggatgcggtgatcaaccatatggcagcacaagatcgctattttccagaagt 

accttacagcagtaatgattttcacagttgcacgggcgatatcgattattccaaccgctggtcgattcaaaattgcgatctggttgggctgaacgat 

ctcaaaaccgagtcagaatacgttcggcagaaaattgcagactatatgaacgatgcgctcagtctgggcgtggcggggtttcggattgatgccg 

ccaagcatatcccggccggcgacatcgcggcgatcaagagcaagctcaacggcagcccgtatatctatcaggaggttatcggggcggcagg 

ggagccggtacaaaccagcgagtacacgtatattggagacgtgacggaatttaacttcgcccggaccatcgggcctaaatttaagcaaggtaat 

attaaagacctgcaggggattggttcgtggagcggctggctgagcagcgacgatgcggtgacctttgtgaccaaccatgacgaagaacgcca 

taaccctggccaggttctcagccatcaggactttggcaatctgtatttcctcggtaacgtgtttactctggcgtatccttacggctacccaaa^ 

tgtcggggtactacttcagtaattttgatgccgggccaccatcgacaggggtacattctggtaatgcgtgtggctttgatggcggtgattggg^^ 

cgaacacaaatggcgtggtgtagccaacatggtggcgtttcgcaaccacacagcagcccagtggcaggtcactgactggtgggacgatggtt 

acaatcaggtggcgtttggtcgtggcgggctgggctttgtggtgatcaatcgagatgacaataaaggcatcaatcagagtttccagacgggaat 

gcccgctggcgagtattgtgacatcattgccggtgatttcgacacccagagcggtcattgcagcgctacgacgatcaccgtcgacagtcaggg 

gtatgcacattttactgtcggtagtcatcaggccgctgcgattcacattggcgcgaaactcggctccgtgtgccaggactgtggcggcacggcc 

gcagagacaaaagtctgctttgacaatgcacaaaactttagccaaccgtatttgcattactggaatgtcaatgcggatcaggccgtagcgaatgc 

aacctggccgggcgtcgcgatgacggctgaaaatggcggttactgctacgattttggtgtcggtctcaattcacttcaggtaattttcagcgata^ 

cggcgccagccaaaccgctgatctgaccgccagcagtccgacgttgtgttaccagaacggaacgtggcgtgacagtgacttctgtcagagta 

gcaatgtgggcaacgagagttggtatttccgtggaacctcaaacggttggggcgtgagcgcactcacttatgaggctgcgacaggcctgtaca 

ctacggtgcagagctttaacggggaggagtcgcccgcacgctttaaaattgatgatggcaactggagtgagtcgtatccaagtgctgattatcaa 

gtcggtgattatgccacctacacgatcacgtttgacagccagacgaaggccatcaccgtgacttcgcagtaa 

SEQID NO: 194 

Met Lys Phe Lys Lys Ser Leu Ser Ala Gly Leu Leu Leu Phe Gly Gly Leu Ser Gly Val Thr Pro Ser 
Val Ala Ala Glu Val Pro Arg Thr Ala Phe Val His Leu Phe Glu Trp Ser Trp Pro Asp lie Ala Thr 
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Glu Cys Glu Thr Phe Leu Gly Pro Lys Gly Phe Ser Ala Val Gin Val Ser Pro Pro Gin Lys Ser Val 
Ser Asn Ala Ala Trp Trp Ala Arg Tyr Gin Pro Val Ser Tyr Ser Phe Glu Gly Arg Ser Gly Thr Arg 
Ala Gin Phe Ala Asp Met Val Gin Arg Cys Lys Ala Val Gly Val Asp He Tyr Leu Asp Ala Val He 
Asn His Met Ala Ala Gin Asp Arg Tyr Phe Pro Glu Val Pro Tyr Ser Ser Asn Asp Phe His Ser Cys 
Thr Gly Asp He Asp Tyr Ser Asn Arg Trp Ser He Gin Asn Cys Asp Leu Val Gly Leu Asn Asp Leu 
Lys Thr Glu Ser Glu Tyr Val Arg Gin Lys He Ala Asp Tyr Met Asn Asp Ala Leu Ser Leu Gly Val 
Ala Gly Phe Arg He Asp Ala Ala Lys His He Pro Ala Gly Asp He Ala Ala He Lys Ser Lys Leu Asn 
Gly Ser Pro Tyr He Tyr Gin Glu Val He Gly Ala Ala Gly Glu Pro Val Gin Thr Ser Glu Tyr Thr Tyr 
He Gly Asp Val Thr Glu Phe Asn Phe Ala Arg Thr He Gly Pro Lys Phe Lys Gin Gly Asn He Lys 
Asp Leu Gin Gly He Gly Ser Trp Ser Gly Trp Leu Ser Ser Asp Asp Ala Val Thr Phe Val Thr Asn 
His Asp Glu Glu Arg His Asn Pro Gly Gin Val Leu Ser His Gin Asp Phe Gly Asn Leu Tyr Phe Leu 
Gly Asn Val Phe Thr Leu Ala Tyr Pro Tyr Gly Tyr Pro Lys Val Met Ser Gly Tyr Tyr Phe Ser Asn 
Phe Asp Ala Gly Pro Pro Ser Thr Gly Val His Ser Gly Asn Ala Cys Gly Phe Asp Gly Gly Asp Tip 
Val Cys Glu His Lys Trp Arg Gly Val Ala Asn Met Val Ala Phe Arg Asn His Thr Ala Ala Gin Tip 
Gin Val Thr Asp Trp Trp Asp Asp Gly Tyr Asn Gin Val Ala Phe Gly Arg Gly Gly Leu Gly Phe 
Val Val He Asn Arg Asp Asp Asn Lys Gly He Asn Gin Ser Phe Gin Thr Gly Met Pro Ala Gly Glu 
Tyr Cys Asp He He Ala Gly Asp Phe Asp Thr Gin Ser Gly His Cys Ser Ala Thr Thr He Thr Val 
Asp Ser Gin Gly Tyr Ala His Phe Thr Val Gly Ser His Gin Ala Ala Ala He His He Gly Ala Lys Leu 
Gly Ser Val Cys Gin Asp Cys Gly Gly Thr Ala Ala Glu Thr Lys Val Cys Phe Asp Asn Ala Gin 
Asn Phe Ser Gin Pro Tyr Leu His Tyr Tip Asn Val Asn Ala Asp Gin Ala Val Ala Asn Ala Thr Trp 
Pro Gly Val Ala Met Thr Ala Glu Asn Gly Gly Tyr Cys Tyr Asp Phe Gly Val Gly Leu Asn Ser Leu 
Gin Val He Phe Ser Asp Asn Gly Ala Ser Gin Thr Ala Asp Leu Thr Ala Ser Ser Pro Thr Leu Cys 
Tyr Gin Asn Gly Thr Trp Arg Asp Ser Asp Phe Cys Gin Ser Ser Asn Val Gly Asn Glu Ser Tip Tyr 
Phe Arg Gly Thr Ser Asn Gly Trp Gly Val Ser Ala Leu Thr Tyr Glu Ala Ala Thr Gly Leu Tyr Thr 
Thr Val Gin Ser Phe Asn Gly Glu Glu Ser Pro Ala Arg Phe Lys He Asp Asp Gly Asn Tip Ser Glu 
Ser Tyr Pro Ser Ala Asp Tyr Gin Val Gly Asp Tyr Ala Thr Tyr Thr He Thr Phe Asp Ser Gin Thr 
Lys Ala He Thr Val Thr Ser Gin 

SEQIDNO: 195 

atgctgacagaccgtttctttgatggcgatacatcaaacaacgacccttacaaccagaactacgatgctaaaaacgaccggggaacttatcagg 

gcggcgattttaaaggaatcacgcaaaaattggattatctcgataagctaggcgtgaacacaatctggatcagcccgatcgtggaaaatatcaag 

catgatgtccgttatgacaactctgaagggcattcatactatgcttaccacggctactgggcaagcaacttcggtgcgttaaacccacacttcggt 

acaatggaagatttccatacactgattgacgctgcccatgaaaaaggcatcaagatcatggttgacgtagtattaaaccacactggttatggctta 

aaagatatcaacggagaagtttccaatcctccagccggttacccaactgacgcagaacgcagcacatatagcagcctgcttcgccagggttca 

aatgtcggctctgatgaggttgttggcgaattagctggcctacctgacttaaaaacagaagaccccgcagtccgccagacaatcatcgactggc 

aaacagactggatcacgaaagctactacagctaaaggaaacacaattgactacttccgtgtcgacactgtgaagcacgttgaagacgcaacat 

ggatggcattcaaaaatgacctcactgaaaaaatgccgacacacaaaatgatcggggaagcttggggagcaagtgccaataaccaacttggat 

accttgaaacaggtatgatggactcactgcttgacttcgacttcaaaggcattgcgcacgatttcgtgaacggcaagcttaaggcagcaaacgat 

gccctgactgcccgcaacggtaaaattgacaacacagctactttaggttcattccttggaagccatgacgaagatggtttcctatttaaagaa^^ 

aatgacaaaggcaagcttaaggttgctgcttccctgcaagcaacatcaaaaggccagccggtcatctattatggtgaagagcttggtcaaagtg 

gagcaaacaactatccgcaatacgataaccgttatgacctggcatgggacaaagttgaaaacaacgacgtccttgagcactacactaaggtcct 

gaacttcagaagcgctcattcagaagtgttcgctaaaggtgaacgcgcaacaattggcggttctgacgctgataaattcttactttttgctcgtaaa 

aatggaaacgaagctgcttacgtcggcttgaacgttgctgacacagcaaaagacgtaacactgactgtttctgcaggtgcagtcgtaactgacc 

actatgcagataaaacttatactgcttcagaagctggagaaatcacattgacgatcccggcaaaagctgatggcggtactgttttactaacggttg 

aaggcggagaaatcacagctgctaaagcggcaagcgaaggcgacggcacagttgagccagtccctgcgaaccacatccgcattcactacaa 

ccgtacagacaacaactatgaaaactacggtgcatggctgtggaacgatgtagcctccccttctgccaactggccgactggcgctacaatgtttg 

aaaaaacagacagctacggtgcatacatcgacgtaccacttaaagagggcgctaagaacatcggcttcctcgttatggatgtaacaaaaggtga 

tcagggtaaagacggcggcgacaaaggttttacgatctcatcacctgaaatgaacgaaatttggatcaagcaaggttctgacaaggtgtacactt 

acgagccagttgatcttccggcgaacactgtccgcgtccactatgtacgtgacaacgcagactacgaaaacttcggtatctggaactggggcga 

tgtaacagcaccttccgaaaactggcctacaggcgcagcgaaattcgatggtacagaccgttacggtgcgtatgtcgacattacgctaaaagaa 
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camggattaaacaaggtgatgacaatgtctacgtttctccatactgggagcaggcaacaggaatcaccaatgcagaggtaatctctgaagata 

cgattctattaggcttcacaatgactgacggcttaacacctgaatctttaaaaggaggtcttgtaattaaagattcaactggt^^ 

gaaagtgctgaaatcacaagcgcaacctctgtaaaagtaaaagcaacattcgatttagaaaagcttccattatccatca^ 

agtttcagcttcaactggctggagaatgcttgatgaaatgtacgcttatgatggaaacgaccttggtgcgacttacaaggacggagcagcgacg 

cttaaattatgggctccgaaagcgagcaaggtaaccgctaacUcmgataaaaataatgccgctgaaaaaatcggcagcgtcgagttaacga 



cggtataactcgccaggtgttagatccttatgcaaaatcaatggcagcctttactgtgaatacagaaggcaatgctggtcctgacggggac^^ 
ttggcaaggcggcaattcaaaaagcttctcgagagtacttctag 

SEQIDNO:196 

Met Leu Thr Asp Arg Phe Phe Asp Gly Asp Thr Ser Asn Asn Asp Pro Tyr Asn Gin Asn Tyr Asp 
Ala Lys Asn Asp Arg Gly Thr Tyr Gin Gly Gly Asp Phe Lys Gly He Thr Gin Lys Leu Asp Tyr Leu 
Asp Lys Leu Gly Val Asn Thr He Trp He Ser Pro He Val Glu Asn He Lys His Asp Val Arg Tyr Asp 
Asn Ser Glu Gly His Ser Tyr Tyr Ala Tyr His Gly Tyr Tip Ala Ser Asn Phe Gly Ala Leu Asn Pro 
His Phe Gly Thr Met Glu Asp Phe His Thr Leu He Asp Ala Ala His Glu Lys Gly He Lys He Met 
Val Asp Val Val Leu Asn His Thr Gly Tyr Gly Leu Lys Asp He Asn Gly Glu Val Ser Asn Pro Pro 
Ala Gly Tyr Pro Thr Asp Ala Glu Arg Ser Thr Tyr Ser Ser Leu Leu Arg Gin Gly Ser Asn Val Gly 
Ser Asp Glu Val Val Gly Glu Leu Ala Gly Leu Pro Asp Leu Lys Thr Glu Asp Pro Ala Val Arg Gin 
Thr He He Asp Tip Gin Thr Asp Trp He Thr Lys Ala Thr Thr Ala Lys Gly Asn Thr He Asp Tyr Phe 
Arg Val Asp Thr Val Lys His Val Glu Asp Ala Thr Tip Met Ala Phe Lys Asn Asp Leu Thr Glu 
Lys Met Pro Thr His Lys Met He Gly Glu Ala Trp Gly Ala Ser Ala Asn Asn Gin Leu Gly Tyr Leu 
Glu Thr Gly Met Met Asp Ser Leu Leu Asp Phe Asp Phe Lys Gly He Ala His Asp Phe Val Asn Gly 
Lys Leu Lys Ala Ala Asn Asp Ala Leu Thr Ala Arg Asn Gly Lys He Asp Asn Thr Ala Thr Leu Gly 
Ser Phe Leu Gly Ser His Asp Glu Asp Gly Phe Leu Phe Lys Glu Gly Asn Asp Lys Gly Lys Leu 
Lys Val Ala Ala Ser Leu Gin Ala Thr Ser Lys Gly Gin Pro Val He Tyr Tyr Gly Glu Glu Leu Gly 
Gin Ser Gly Ala Asn Asn Tyr Pro Gin Tyr Asp Asn Arg Tyr Asp Leu Ala Trp Asp Lys Val Glu 
Asn Asn Asp Val Leu Glu His Tyr Thr Lys Val Leu Asn Phe Arg Ser Ala His Ser Glu Val Phe Ala 
Lys Gly Glu Arg Ala Thr He Gly Gly Ser Asp Ala Asp Lys Phe Leu Leu Phe Ala Arg Lys Asn Gly 
Asn Glu Ala Ala Tyr Val Gly Leu Asn Val Ala Asp Thr Ala Lys Asp Val Thr Leu Thr Val Ser Ala 
Gly Ala Val Val Thr Asp His Tyr Ala Asp Lys Thr Tyr Thr Ala Ser Glu Ala Gly Glu He Thr Leu 
Thr He Pro Ala Lys Ala Asp Gly Gly Thr Val Leu Leu Thr Val Glu Gly Gly Glu He Thr Ala Ala 
Lys Ala Ala Ser Glu Gly Asp Gly Thr Val Glu Pro Val Pro Ala Asn His He Arg He His Tyr Asn 
Arg Thr Asp Asn Asn Tyr Glu Asn Tyr Gly Ala Trp Leu Trp Asn Asp Val Ala Ser Pro Ser Ala Asn 
Trp Pro Thr Gly Ala Thr Met Phe Glu Lys Thr Asp Ser Tyr Gly Ala Tyr He Asp Val Pro Leu Lys 
Glu Gly Ala Lys Asn He Gly Phe Leu Val Met Asp Val Thr Lys Gly Asp Gin Gly Lys Asp Gly Gly 
Asp Lys Gly Phe Thr He Ser Ser Pro Glu Met Asn Glu He Trp He Lys Gin Gly Ser Asp Lys Val 
Tyr Thr Tyr Glu Pro Val Asp Leu Pro Ala Asn Thr Val Arg Val His Tyr Val Arg Asp Asn Ala Asp 
Tyr Glu Asn Phe Gly He Trp Asn Trp Gly Asp Val Thr Ala Pro Ser Glu Asn Trp Pro Thr Gly Ala 
Ala Lys Phe Asp Gly Thr Asp Arg Tyr Gly Ala Tyr Val Asp He Thr Leu Lys Glu Gly Ala Lys Asn 
He Gly Met He Ala Leu Asn Thr Ala Asn Gly Glu Lys Asp Gly Gly Asp Lys Ser Phe Asn Leu Leu 
Asp Lys Tyr Asn Arg He Trp He Lys Gin Gly Asp Asp Asn Val Tyr Val Ser Pro Tyr Trp Glu Gta 
Ala Thr Gly He Thr Asn Ala Glu Val He Ser Glu Asp Thr He Leu Leu Gly Phe Thr Met Thr Asp 
Gly Leu Thr Pro Glu Ser Leu Lys Gly Gly Leu Val He Lys Asp Ser Thr Gly Ala Glu Val Ala He 
Glu Ser Ala Glu He Thr Ser Ala Thr Ser Val Lys Val Lys Ala Thr Phe Asp Leu Glu Lys Leu Pro 
Leu Ser He Thr Tyr Ala Gly Arg Thr Val Ser Ala Ser Thr Gly Trp Arg Met Leu Asp Glu Met Tyr 
Ala Tyr Asp Gly Asn Asp Leu Gly Ala Thr Tyr Lys Asp Gly Ala Ala Thr Leu Lys Leu Trp Ala Pro 
Lys Ala Ser Lys Val Thr Ala Asn Phe Phe Asp Lys Asn Asn Ala Ala Glu Lys He Gly Ser Val Glu 
Leu Thr Lys Gly Glu Lys Gly Val Trp Ser Ala Met Val Ala Pro Gly Asp Leu Asn Val Thr Asp Leu 
Glu Gly Tyr Phe Tyr Gin Tyr Asp Val Thr Asn Asp Gly He Thr Arg Gin Val Leu Asp Pro Tyr Ala 
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Lys Sei Met Ala Ala Phe Thr Val Asn Thr Glu Gly Asn Ala Gly Pro Asp Gly Asp Thr Val Gly Lys 
Ala Ala He Gin Lys Ala Ser Arg Glu Tyr Phe 

SEQIDNO:197 

atgaaaccgtcaaaattcgtttttctctctgctgccatcgcttgcagcctctccagtaccgccaatgctgacgccatmgcatgcattt^ 

tactccgacgtcacgcaaaacgcctcgcaaatcgcggcggcgggttataaaaaagtgctgatttcgccagcactgaaatcgagtggcaatgaa 

tggtgggcacgnatcaaccgcaagatctgcgcgtgatcgattccccacttggcaacaaaagtgacttaaaatccat^^ 

ggtcggcgttgatgtgtatgccgatgtggtgcttaaccatatggccaatgaaacatggaagcgtgaagacttaaattaccctggcagtgaagtgc 

tgcaacaatacgcagctaacaccagttattatgcggaccaaacgcttmggcaatttaacggaaaacctattctctggcWgacttccac^^ 

aggctgtattagcgaUggaatgatgccggcaatgttcagtactggcgtcmgtggcggtgctggtgaccgagggct^^^ 

acaactgggtggtgtcacagcaacgmgtamgaatgcgctaaaaggtttaggtgtgaaaggcttccgcattgatgcggttaaacacatgagcc 

aatatcaaatcgaccagattttcactgcagagattaccgccggaatgcacgtgtttggtgaagtgatcaccagtggtggcaaaggcgactccag 

ctatgagaacttcttagcgccttatctcaacgccaccaaccattcggcttacgatttcccactgmgcctctattcgcaac^^ 

^gcatgaacatgcttcatgatccacaagcctatggccaagggcttgaaaacgcacgttcaattacctttaccatcacgcacgacatcccaacga 

acgacgg^ttccgttatcaaatcatggatccgaaagatgaagagctggcttacgcttatatcctcggtaaagatggcggcacacctctgatto 

gcgacaacttacctgataacgaagatcgjgataatcgccgttgggaaggtgmggaaccg^gacctgatgaagaacatgttgcgctt^^ 

caaatgcaagggcaagagatgacgatgctgtacagcgaccaatgtctactgatgtttaagcgcggtaaacaaggggtggtcggcattaataaat 

gcgglgaagagcgttctcataccgttgacacctatcagcatgagttcaactggtatcagccttacacagatacactcactggcgtgactgaaacc 

gtgagttcgcgttaccacaccttccgaattccagctcgcagcgcgcgcatgtacatgctctaa 

SEQIDNO: 198 

Met Lys Pro Ser Lys Phe Val Phe Leu Ser Ala Ala He Ala Cys Ser Leu Ser Ser Thr Ala Asn Ala 
Asp Ala He Leu His Ala Phe Asn Tip Lys Tyr Ser Asp Val Thr Gin Asn Ala Ser Gin He Ala Ala 
Ala Gly Tyr Lys Lys Val Leu He Ser Pro Ala Leu Lys Ser Ser Gly Asn Glu Tip Trp Ala Arg Tyr 
Gin Pro Gin Asp Leu Arg Val He Asp Ser Pro Leu Gly Asn Lys Ser Asp Leu Lys Ser Met He Asp 
Ala Leu Lys Ala Val Gly Val Asp Val Tyr Ala Asp Val Val Leu Asn His Met Ala Asn Glu Thr Trp 
Lys Arg Glu Asp Leu Asn Tyr Pro Gly Ser Glu Val Leu Gin Gin Tyr Ala Ala Asn Thr Ser Tyr Tyr 
Ala Asp Gin Thr Leu Phe Gly Asn Leu Thr Glu Asn Leu Phe Ser Gly Phe Asp Phe His Pro Glu 
Gly Cys He Ser Asp Trp Asn Asp Ala Gly Asn Val Gin Tyr Trp Arg Leu Cys Gly Gly Ala Gly Asp 
Arg Gly Leu Pro Asp Leu Asp Pro Asn Asn Tip Val Val Ser Gin Gin Arg Leu Tyr Leu Asn Ala 
Leu Lys Gly Leu Gly Val Lys Gly Phe Arg lie Asp Ala Val Lys His Met Ser Gin Tyr Gin He Asp 
Gin He Phe Thr Ala Glu He Thr Ala Gly Met His Val Phe Gly Glu Val He Thr Ser Gly Gly Lys Gly 
Asp Ser Ser Tyr Glu Asn Phe Leu Ala Pro Tyr Leu Asn Ala Thr Asn His Ser Ala Tyr Asp Phe Pro 
Leu Phe Ala Ser He Arg Asn Ala Phe Ser Tyr Ser Gly Gly Met Asn Met Leu His Asp Pro Gin Ala 
Tyr Gly Gin Gly Leu Glu Asn Ala Arg Ser He Thr Phe Thr He Thr His Asp He Pro Thr Asn Asp 
Gly Phe Arg Tyr Gin He Met Asp Pro Lys Asp Glu Glu Leu Ala Tyr Ala Tyr He Leu Gly Lys Asp 
Gly Gly Thr Pro Leu He Tyr Ser Asp Asn Leu Pro Asp Asn Glu Asp Arg Asp Asn Arg Arg Trp 
Glu Gly Val Trp Asn Arg Asp Leu Met Lys Asn Met Leu Arg Phe His Asn Gin Met Gin Gly Gin 
Glu Met Thr Met Leu Tyr Ser Asp Gin Cys Leu Leu Met Phe Lys Arg Gly Lys Gin Gly Val Val 
Gly He Asn Lys Cys Gly Glu Glu Arg Ser His Thr Val Asp Thr Tyr Gin His Glu Phe Asn Trp Tyr 
Gin Pro Tyr Thr Asp Thr Leu Thr Gly Val Thr Glu Thr Val Ser Ser Arg Tyr His Thr Phe Arg He 
Pro Ala Arg Ser Ala Arg Met Tyr Met Leu 

SEQIDNO: 199 

gtgagtttgaccaaaaaggctcagtacgaaccaaatacggcaccaaggctcagtacatctctgcaatcaatgccgcgcacaacaacaatatcca 

aatttacggcgatgttgtgtttaaccaccgaggtggtgctgatgggaagtcgtgggtcgataccaagcgcgttgattgggacaaccgcaatattg 

aactgggcgacaaatggattgaagcttgggttgagtttaattttcctggccgcaacgacaaatactcgaacttccattggacttggtatcactttgac 

ggtgttgactgggatgacgccggcaaagaaaaagcgatctttaaattcaaaggcgaaggaaaagcatgggattgggaagtcagctctgaaaa 
aggcaattacgactacctaa 
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SEQIDNO:200 

Val Ser Leu Thr Lys Lys Ala Gin Tyr Glu Pro Asn Thr Ala Pro Arg Leu Ser Thr Ser Leu Gin Ser 
Met Pro Arg Thr Thr Thr He Ser Lys Phe Thr Ala Met Leu Cys Leu Thr Thr Glu Val Val Leu Met 
Gly Ser Arg Gly Ser He Pro Ser Ala Leu He Gly Thr Thr Ala He Leu Asn Tip Ala Thr Asn Gly Leu 
Lys Leu Gly Leu Ser Leu He Phe Leu Ala Ala Thr Thr Asn thr Arg Thr Ser He Gly Leu Gly He 
Thr Leu Thr Val Leu Thr Gly Met Thr Pro Ala Lys Lys Lys Arg Ser Leu Asn Ser Lys Ala Lys Glu 
Lys His Gly He Gly Lys Ser Ala Leu Lys Lys Ala He Thr Thr Thr 

SEQ ID NO: 201 

atgacagccaaggctgatgacttacgcatttaccagatcatggtggaaagctttgtggatggcgataaacaggtcggccatggcaccggc^^ 

gtaccagccatcacaaaggcgatctgcaagggatcattgactcgctggattacattcaatcgctgggcgtcaatgccatttggctaacgccgattt 

ttgaatctattccggtggagggacaagaccattgggcggacaggcttgatgctacaggctactttgccagtgactatttcaagatagaccctcgct 

ttggcacgttagaacaagcccgtgagctggtggaaaaggcacacgcgaaaggcttg^atgtcttcmgatggagtatttggtcaccataaa^^ 

aatgtggtgccatcaccacaaggtagactgcctgtcggtgaaaataacccggtcagctacccagagagcctggcgttttacgaagaagtcgcc 

agttactgggtgaaagagttaaagattgatggctggcgtctggatcaagcctatcaagtgccgaccgatgcatggaaagcgatccgtcagagc 

gttgatgaagcgtcacagtccgtaacttatgtgaataacaaaggggaaaccgtccatcctttgggttacatggtggctgaaatttggaataac^ 

cgttacatcacagaaaccggttacggcaaagaaggcgatccggcgttgtgctcggcttttgatmccgatgcgtttccgagtggtcgaaaccttt 

gcggttaacgaaagtggtgtcagccgaaaaggcggcgaatggttgaatgacggcatgtcactgcacagtcagtatccggatcatgccaagcct 

aatttaatgttgggcaaccatgatgtggtgcgctttggggatctgctgcaacgtggcggtattgcgtcaccagaacaaccgcaatactggcagcg 

tcataaagcggcgatgtctttcttagcagcgtataccggcccaattaccttgtattacggtgaagaaattggcgatcaggttgacggctttgctaaa 

aaaatcaaagaagattgtgccgttattggtttgtgtgatgaccacgtggcgcgcaccagtgcgaagattgatggcgtgacggcgtcactgaatg 

cacagcagtctgaactcaaagtatatgtctcttcattgatgacattacgtcagcaacatcctgcgttatcacaaggggaacgtactaatgtgatggc 

gacagagacagtatacgtagaccataaacaggcagacaatgaagccctgttgtacatggtgagtacgactgataacgcggagtcagtcacctt 

gaagggcaaagcgattggttcacaaggtgtgctgattgatttgttaacgaacgagcgttttatgcccaataatggggagtatgccattccattaac 

gggctttggcgcacgattcctcaagattgacactccgacagcggcgggtgtgatggcgcaatctgctgcctcggtatcgctagtaggtgaagg 

gatcatggcccaatgtgataccccaaccgttgaaggcaccggtccggtagcagaaaccttgtacgtggttggcgattttgccgatgctggttgga 

agcaaaagccgcagcgcgcgtatcaatacaaaggcaagcacaatggcagcaacttgtatcaagtggttgtcgatgaaaaagcgggcgcctac 

aagatgcaatacgccacgaaagattggagcccacagtttactgcagacggtatggcattgaagccgggtaccgcaaagtcgctcatagcgggt 

ggctacggtaaagacaccgccgtgacgttgccggaatccggtaagtatgtgtggagcttaacattcagtgatcttggcgagccggagcaaatc 

atggtgtctaagtgtcagtaa 

SEQ ID NO: 202 

Met Thr Ala Lys Ala Asp Asp Leu Arg He Tyr Gin He Met Val Glu Ser Phe Val Asp Gly Asp Lys 
Gin Val Gly His Gly Thr Gly Tyr Gly Thr Ser His His Lys Gly Asp Leu Gin Gly He He Asp Ser 
Leu Asp Tyr He Gin Ser Leu Gly Val Asn Ala He Trp Leu Thr Pro He Phe Glu Ser He Pro Val Glu 
Gly Gin Asp His Trp Ala Asp Arg Leu Asp Ala Thr Gly Tyr Phe Ala Ser Asp Tyr Phe Lys He Asp 
Pro Arg Phe Gly Thr Leu Glu Gin Ala Arg Glu Leu Val Glu Lys Ala His Ala Lys Gly Leu Tyr Val 
Phe Phe Asp Gly Val Phe Gly His His Lys Gly Asn Val Val Pro Ser Pro Gin Gly Arg Leu Pro Val 
Gly Glu Asn Asn Pro Val Ser Tyr Pro Glu Ser Leu Ala Phe Tyr Glu Glu Val Ala Ser Tyr Trp Val 
Lys Glu Leu Lys He Asp Gly Trp Arg Leu Asp Gin Ala Tyr Gin Val Pro Thr Asp Ala Trp Lys Ala 
He Arg Gin Ser Val Asp Glu Ala Ser Gin Ser Val Thr Tyr Val Asn Asn Lys Gly Glu Thr Val His 
Pro Leu Gly Tyr Met Val Ala Glu He Trp Asn Asn Glu Arg Tyr He Thr Glu Thr Gly Tyr Gly Lys 
Glu Gly Asp Pro Ala Leu Cys Ser Ala Phe Asp Phe Pro Met Arg Phe Arg Val Val Glu Thr Phe Ala 
Val Asn Glu Ser Gly Val Ser Arg Lys Gly Gly Glu Trp Leu Asn Asp Gly Met Ser Leu His Ser Gin 
Tyr Pro Asp His Ala Lys Pro Asn Leu Met Leu Gly Asn His Asp Val Val Arg Phe Gly Asp Leu 
Leu Gin Arg Gly Gly He Ala Ser Pro Glu Gin Pro Gin Tyr Trp Gin Arg His Lys Ala Ala Met Ser 
Phe Leu Ala Ala Tyr Thr Gly Pro He Thr Leu Tyr Tyr Gly Glu Glu He Gly Asp Gin Val Asp Gly 
Phe Ala Lys Lys He Lys Glu Asp Cys Ala Val He Gly Leu Cys Asp Asp. His Val Ala Arg Thr Ser 
Ala Lys He Asp Gly Val Thr Ala Ser Leu Asn Ala Gin Gin Ser Glu Leu Lys Val Tyr Val Ser Ser 
Leu Met Thr Leu Arg Gin Gin His Pro Ala Leu Ser Gin Gly Glu Arg Thr Asn Val Met Ala Thr Glu 
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Thr Val Tyr Val Asp His Lys Gin Ala Asp Asn Glu Ala Leu Leu Tyr Met Val Ser Thr Thr Asp Asn 
Ala Glu Ser Val Thr Leu Lys Gly Lys Ala He Gly Ser Gin Gly Val Leu He Asp Leu Leu Thr Asn 
Glu Arg Phe Met Pro Asn Asn Gly Glu Tyr Ala He Pro Leu Thr Gly Phe Gly Ala Arg Phe Leu Lys 
He Asp Thr Pro Thr Ala Ala Gly Val Met Ala Gin Ser Ala Ala Ser Val Ser Leu Val Gly Glu Gly He 
Met Ala Gin Cys Asp Thr Pro Thr Val Glu Gly Thr Gly Pro Val Ala Glu Thr Leu Tyr Val Val Gly 
Asp Phe Ala Asp Ala Gly Trp Lys Gin Lys Pro Gin Arg Ala Tyr Gin Tyr Lys Gly Lys His Asn Gly 
Ser Asn Leu Tyr Gin Val Val Val Asp Glu Lys Ala Gly Ala Tyr Lys Met Gin Tyr Ala Thr Lys Asp 
Trp Ser Pro Gin Phe Thr Ala Asp Gly Met Ala Leu Lys Pro Gly Thr Ala Lys Ser Leu He Ala Gly 
Gly Tyr Gly Lys Asp Thr Ala Val Thr Leu Pro Glu Ser Gly Lys Tyr Val Trp Ser Leu Thr Phe Ser 
Asp Leu Gly Glu Pro Glu Gin He 

SEQIDNO:203 

atgaagatgaagtcccgggcgtggttgttaggtagtgcagtggccatggcgttggcctcttcggcagccaatgccggtgtcatggttcaw^ 

ccagtggaagtacaatgacatcgccaacgagtgcgaaaaggtgctcggtcccaaagggtatgaagcagtgcagatcacgccgcctgctgaa 

cacctgcaaggctcctcctggtgggtggtctatcagcccgtcagctacaagaacttcacttctctgggcggtaacgaggccgaactcaaaagca 

tgatcgcccgttgcaaggccgccggggtcaagatttacgccgatgcggtattcaaccagctggctggtggatcaggcgtcggtacaggtggta 

gcagctacaatgccggcagcttcagctatccccaatttggctacaacgatttccatcacgctgggagcctcaccaactatgccgaccgcaacaa 

tgtgcaaaacggtgccctgctggggctgccggatctggataccggctctgcctatgtgcaggatcagctggctacctatatgaagaccctgagt 

ggctggggtgtggcaggttttcgtcttgatgcagcaaagcatatgagcgttgccgatctctcggccatcgtcagcaaggcgggcaatccttttgt 

ctactccgaggtgattggtgccacgggtgaaccaatccagccgggcgaatataccggcattggtgccgtgaccgaatttaaatacggcaccga 

tctggcctccaacttcaaggggcagatcaagaatctcaagagcatgggcgagagctggggtctgcttgcgtcgaacaaggctgaagtctttgtg 

gtcaaccatgaccgtgagcggggacatggcggtggcggtatgctgacctacaaggatggtgccctctacaatctggccaacatcttcatgctg 

gcctggccctatggcgcctatccccaggtgatgtccggctatgatttcggcaccaataccgatattggtgggccgagcgctaccccttgttcttcc 

ggctctagctggaactgcgaacaccgctggagcaacatcgccaacatggtctcgttccacaatgccgcccaaggcacgtccatgaccaactg 

gtgggataatggtaataaccagatcgcctttggtcgcggcgccaaggcctttgtggtgatcaacaatgaatcttccactctgagcaagagcctgc 

agacgggtctgccagccggggagtactgcaacattctggccggtgatgccctgtgcagcggcagcaccatcaaggtggatgccagcggtat 

ggccaccttcaacgtggcagggatgaaggcggcagcgatccatatcaatgccaagcccgatagcaccagcagtggcagctcaggctcttcct 

ctggctcttcttcctctgccaccagtaacaagtttgccagcatgaatctgcggggcaccaacaatggctgggccagcaccgccatgacagtgga 

tgccaaccgtgtctggtcggcggatgtcacctttaccggggccgcggatgccaatggtgcccagcgcttcaagtttgatgtctatggcaactgg 

acagagagctatggcgatacacaagccgatggcattgccgacaaggggagcgccaaggacatctatttcaatggtgtgggcaagtatcgtgtc 

tcgctcaaggagagcgacatgagctacaccctgacccagctctccagcaatcaggcaccggtggcggccatcacccccaagacactctccgt 

caagctgggtgactcagtggtgttcgatgcctccggctccaccgatgatgtgggtgtcactggctacagctggtctaccggtggcagtgccaag 

accgaaactgtgctgtttgatgctctgggtaccaagaccattaccgtgacagtggccgatgccgatggcttgacctccaaggccagtgccaccg 

tcaccgtcaccgatggcagcgtggcttataacagcaactttgccagcctgaacttccgtggcactcccaacagttggggcgcggcagccatga 

cgctggtggcagacaacacctgggaggcaacggtcaacftcgatggtcaggccaatcagcgcttcaagttcgatatcaagggtgactggagc 

cagaactatggtgatagcaacaaggatggggtggccgaacgtaccggtgccgatatttacaccactgtgaccggtcaatataaggtgcaa^ 

acgactccactttgaagtacaccctgaccaagctggccgatagcagcgccaccagctatagcgcgaactttgccagcctctacctgcgtggca 

ccccgaacagctggggcaccaccgccatgaagctggtggccaataacagctggcaggccgaggtgaccttcaccggcaagggcgatgcca 

ctggtgcccaacgcttcaagttcgacgtcaagggtgactggagccagaactacggtgacagcaacatggacgggactgccgaacggact^ 

tggcgatatcaccagtgccgtggtgggcacctatctggtgacctttaatgacagcacactgaaatacaccctgaccgccaaataa 

SEQ ID NO: 204 

Met Lys Met Lys Ser Arg Ala Trp Leu Leu Gly Ser Ala Val Ala Met Ala Leu Ala Ser Ser Ala Ala 
Asn Ala Gly Val Met Val His Leu Phe Gin Trp Lys Tyr Asn Asp He Ala Asn Glu Cys Glu Lys Val 
Leu Gly Pro Lys Gly Tyr Glu Ala Val Gin He Thr Pro Pro Ala Glu His Leu Gin Gly Ser Ser Trp 
Trp Val Val Tyr Gin Pro Val Ser Tyr Lys Asn Phe Thr Ser Leu Gly Gly Asn Glu Ala Glu Leu Lys 
Ser Met He Ala Arg Cys Lys Ala Ala Gly Val Lys He Tyr Ala Asp Ala Val Phe Asn Gin Leu Ala 
Gly Gly Ser Gly Val Gly Thr Gly Gly Ser Ser Tyr Asn Ala Gly Ser Phe Ser Tyr Pro Gin Phe Gly 
Tyr Asn Asp Phe His His Ala Gly Ser Leu Thr Asn Tyr Ala Asp Arg Asn Asn Val Gin Asn Gly 
Ala Leu Leu Gly Leu Pro Asp Leu Asp Thr Gly Ser Ala Tyr Val Gin Asp Gin Leu Ala Thr Tyr Met 
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Lys Thr Leu Ser Gly Trp Gly Val Ala Gly Phe Arg Leu Asp Ala Ala Lys His Met Ser Val Ala Asp 
Leu Ser Ala He Val Ser Lys Ala Gly Asn Pro Phe Val Tyr Ser Glu Val He Gly Ala Thr Gly Glu Pro 
He Gin Pro Gly Glu Tyr Thr Gly He Gly Ala Val Thr Glu Phe Lys Tyr Gly Thr Asp Leu Ala Ser 
Asn Phe Lys Gly Gin He Lys Asn Leu Lys. Ser Met Gly Glu Ser Trp Gly Leu Leu Ala Ser Asn Lys 
Ala Glu Val Phe Val Val Asn His Asp Arg Glu Arg Gly His Gly Gly Gly Gly Met Leu Thr Tyr Lys 
Asp Gly Ala Leu Tyr Asn Leu Ala Asn He Phe Met Leu Ala Tip Pro Tyr Gly Ala Tyr Pro Gin Val 
Met Ser Gly Tyr Asp Phe Gly Thr Asn Thr Asp He Gly Gly Pro Ser Ala Thr Pro Cys Ser Ser Gly 
Ser Ser Trp Asn Cys Glu His Arg Trp Ser Asn He Ala Asn Met Val Ser Phe His Asn Ala Ala Gin 
Gly Thr Ser Met Thr Asn Trp Trp Asp Asn Gly Asn Asn Ghi He Ala Phe Gly Arg Gly Ala Lys Ala 
Phe Val Val He Asn Asn Glu Ser Ser Thr Leu Ser Lys Ser Leu Gin Thr Gly Leu Pro Ala Gly Glu 
Tyr Cys Asn He Leu Ala Gly Asp Ala Leu Cys Ser Gly Ser Thr He Lys Val Asp Ala Ser Gly Met 
Ala Thr Phe Asn Val Ala Gly Met Lys Ala Ala Ala He His He Asn Ala Lys Pro Asp Ser Thr Ser 
Ser Gly Ser Ser Gly Ser Ser Ser Gly Ser Ser Ser Ser Ala Thr Ser Asn Lys Phe Ala Ser Met Asn Leu 
Arg Gly Thr Asn Asn Gly Tip Ala Ser Thr Ala Met Thr Val Asp Ala Asn Arg Val Trp Ser Ala Asp 
Val Thr Phe Thr Gly Ala Ala Asp Ala Asn Gly Ala Gin Arg Phe Lys Phe Asp Val Tyr Gly Asn Trp 
Thr Glu Ser Tyr Gly Asp Thr Gin Ala Asp Gly He Ala Asp Lys Gly Ser Ala Lys Asp He Tyr Phe 
Asn Gly Val Gly Lys Tyr Arg Val Ser Leu Lys Glu Ser Asp Met Ser Tyr Thr Leu Thr Gin Leu Ser 
Ser Asn Gin Ala Pro Val Ala Ala He Thr Pro Lys Thr Leu Ser Val Lys Leu Gly Asp Ser Val Val 
Phe Asp Ala Ser Gly Ser Thr Asp Asp Val Gly Val Thr Gly Tyr Ser Trp Ser Thr Gly Gly Ser Ala 
Lys Thr Glu Thr Val Leu Phe Asp Ala Leu Gly Thr Lys Thr He Thr Val Thr Val Ala Asp Ala Asp 
Gly Leu Thr Ser Lys Ala Ser Ala Thr Val Thr Val Thr Asp Gly Ser Val Ala Tyr Asn Ser Asn Phe 
Ala Ser Leu Asn Phe Arg Gly Thr Pro Asn Ser Trp Gly Ala Ala Ala Met Thr Leu Val Ala Asp Asn 
Thr Trp Glu Ala Thr Val Asn Phe Asp Gly Gin Ala Asn Gin Arg Phe Lys Phe Asp He Lys Gly Asp 
Trp Ser Gin Asn Tyr Gly Asp Ser Asn Lys Asp Gly Val Ala Glu Arg Thr Gly Ala Asp He Tyr Thr 
Thr Val Thr Gly Gin Tyr Lys Val Gin Phe Asn Asp Ser Thr Leu Lys Tyr Thr Leu Thr Lys Leu Ala 
Asp Ser Ser Ala Thr Ser Tyr Ser Ala Asn Phe Ala Ser Leu Tyr Leu Arg Gly Thr Pro Asn Ser Trp 
Gly Thr Thr Ala Met Lys Leu Val Ala Asn Asn Ser Trp Gin Ala Glu Val Thr Phe Thr Gly Lys Gly 
Asp Ala Thr Gly Ala Gin Arg Phe Lys Phe Asp Val Lys Gly Asp Trp Ser Gin Asn Tyr Gly Asp Ser 
Asn Met Asp Gly Thr Ala Glu Arg Thr Gly Gly Asp He Thr Ser Ala Val Val Gly Thr Tyr Leu Val 
Thr Phe Asn Asp Ser Thr Leu Lys Tyr Thr Leu Thr Ala Lys 

SEQ ID NO: 205 

atgtaccgcgtaatacctattattttgattatgagtatgattgtagcttgtgagtctccaaagaaaaaaacaaccgaaaccgctcaaccttcaa 

tgccgaaaaaccctttgtttgggaggctgccaatgtatattttttgttaactgaccgttttaacaacggtaacccaaacaatgacatcaattttaatag 

gactaaagaatcaggaaaactccgcaattttatgggaggcgatatcaagggcatcacccaaaaaataaatgaggggtattttagtaaactaggc 

gttaatgccatctggcttaccccggttgttgaacaaatacatggcagtgttgatgaaggtaccggcaatacctatgcctttcatggctattgggc^^ 

aagattggacaaacttagacccaaattttggcacaaaagaagaccttgccgaactggtggcaactgcccatgcaaaaggcatcaggatactttta 

gatgtggtaataaaccacaccggcccggtaaccgaccaagacccggtttggggagaagattgggtacgtacaggcccgcagtgtacctatga 

taattacaccaataccaccagttgcacgctggtagccaatttacctgatatacttacagaaagtaatgaaaatgtggccttaccaaccttm 

taaatggaaagccgaaggcagattagagcaagaactaaaagaacttgacgattttttttcccgcacaggccacccacgcgcaccccgcttttac 

attattaaatggcttaccgattacatccgagaatttggggtagatgggtttagggttgataccgtaaaacataccgaagaaacggtttgggccgag 

ttgtatgatgaagccgtaattgcttttgccgaatataaaaaagccaacccagacaaggtattggacgataatgaattttatatggtaggcgaagtgt 

acaactacggtatttccggcggaaggttctatgatttcggcgataaaaaggtggactattttgaccacggatttaaaagcctcatcaattttgaaatg 

aaatatgatgccaattttacctacgatacactttttaggaagtacgatacccttttgcataccaaacttaaaggcagaagtgtgctcaactacctctca 

tctcacgacgatggaagtccatttgataaaatgcggcaaaaaccatacgagtcggctacaaaattactgctcactccgggcgcatcccaaatttat 

tacggtgacgaaaccgccagaagccttaacatagaaggcgcacagggagatgctacgcttcgttcgtttatgaattgggaagagctcgcagaa 

gaccctgccaagcaaaaaatacttcagcattggcaaaaactgggcagtttcaggaacaaccaccccgcagttggtgccggaaggcacaaaac 

ccttggcaaaaagccgttttacacctttagcagggtttatcaaaaaaatggttttattgacaaagttgtggtagcattagatgcccctaaaggccaaa 

aacaaattaccgttaatggtgtttttgatgacggtacaaaacttgtagatgcctattcaggcaaagaaacctcagttaaaaatggtatcgtttcacttt 

cttctgaatttgatattgttttgttagaacaaaaataa 
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SEQIDNO:206 

Met Tyr Arg Val lie Pro He He Leu He Met Ser Met He Val Ala Cys Glu Ser Pro Lys Lys Lys Thr 
Thr Glu Thr Ala Gin Pro Ser Thr Asn Ala Glu Lys Pro Phe Val Trp Glu Ala Ala Asn Val Tyr Phe 
Leu Leu Thr Asp Arg Phe Asn Asn Gly Asn Pro Asn Asn Asp Me Asn Phe Asn Arg Thr Lys Glu 
Ser Gly Lys Leu Arg Asn Phe Met Gly Gly Asp He Lys Gly He Thr Gin Lys He Asn Glu Gly Tyr 
Phe Ser Lys Leu Gly Val Asn Ala He Trp Leu Thr Pro Val Val Glu Gin He His Gly Ser Val Asp 
Glu Gly Thr Gly Asn Thr Tyr Ala Phe His Gly Tyr Trp Ala Lys Asp Trp Thr Asn Leu Asp Pro Asn 
Phe Gly Thr Lys Glu Asp Leu Ala Glu Leu Val Ala Thr Ala His Ala Lys Gly He Arg He Leu Leu 
Asp Val Val He Asn His Thr Gly Pro Val Thr Asp Gbi Asp Pro Val Tip Gly Glu Asp Trp Val Arg 
Thr Gly Pro Gin Cys Thr Tyr Asp Asn Tyr Thr Asn Thr Thr Ser Cys Thr Leu Val Ala Asn Leu Pro 
Asp He Leu Thr Glu Ser Asn Glu Asn Val Ala Leu Pro Thr Phe Leu Leu Asp Lys Tip Lys Ala Glu 
Gly Arg Leu Glu Gin Glu Leu Lys Glu Leu Asp Asp Phe Phe Ser Arg Thr Gly His Pro Arg Ala Pro 
Arg Phe Tyr He He Lys Tip Leu Thr Asp Tyr He Arg Glu Phe Gly Val Asp Gly Phe Arg Val Asp 
Thr Val Lys His Thr Glu Glu Thr Val Trp Ala Glu Leu Tyr Asp Glu Ala Val He Ala Phe Ala Glu 
Tyr Lys Lys Ala Asn Pro Asp Lys Val Leu Asp Asp Asn Glu Phe Tyr Met Val Gly Glu Val Tyr 
Asn Tyr Gly He Ser Gly Gly Arg Phe Tyr Asp Phe Gly Asp Lys Lys Val Asp Tyr Phe Asp His Gly 
Phe Lys Ser Leu He Asn Phe Glu Met Lys Tyr Asp Ala Asn Phe Thr Tyr Asp Thr Leu Phe Arg Lys 
Tyr Asp Thr Leu Leu His Thr Lys Leu Lys Gly Arg Ser Val Leu Asn Tyr Leu Ser Ser His Asp Asp 
Gly Ser Pro Phe Asp Lys Met Arg Gin Lys Pro Tyr Glu Ser Ala Thr Lys Leu Leu Leu Thr Pro Gly 
Ala Ser Gin He Tyr Tyr Gly Asp Glu Thr Ala Arg Ser Leu Asn He Glu Gly Ala Gin Gly Asp Ala 
Thr Leu Arg Ser Phe Met Asn Trp Glu Glu Leu Ala Glu Asp Pro Ala Lys Gin Lys He Leu Gin His 
Trp Gin Lys Leu Gly Ser Phe Arg Asn Asn His Pro Ala Val Gly Ala Gly Arg His Lys Thr Leu Gly 
Lys Lys Pro Phe Tyr Thr Phe Ser Arg Val Tyr Gin Lys Asn Gly Phe He Asp Lys Val Val Val Ala 
Leu Asp Ala Pro Lys Gly Gin Lys Gin He Thr Val Asn Gly Val Phe Asp Asp Gly Thr Lys Leu Val 
Asp Ala Tyr Ser Gly Lys Glu Thr Ser Val Lys Asn Gly He Val Ser Leu Ser Ser Glu Phe Asp He 
Val Leu Leu Glu Gin Lys 

SEQ ID NO: 207 

ctgtcgactgagcctttcgttttgggctcgagactgactctcagcccaccccgcagtagctccagacggagtagccgtaatagccgttggccgg 

gtcgtgggcaggggcctcgaggtacacccacccgcttgagtccacccacttgtccacccagccgccgaggttgccggtgtactcgtggatgc 

acgctcccgcgaacttcggaacgtagacccaccttccggctttgcttgaggcgaggttgatgtatgttatcagtcccggcttgcttccgtagccgt 

ttctcacgaatatcagctcgtcgttgtcgtagtaaacgacgtcaglgcttcctccggccaggttgtcatgtatccagatgaggttcttgagcttatcct 

tgttgagccactcctcgtagtcgcggtagaatattgtcggctggccctcgtaggtgaggatgaacgcgtaggctggatacttgttccagattatat 

cggtgtcgtggtttgcaacgaaggttacggccttaaacgggtcgcggctgacgactgtgcccccgttcttgagggcctcgacgagtgcgggaa 

tgttcttgttgtcaaaggccgcgtccatcttgtagtagagcgggaagtcgaagaccttggcgccgctcgagtaggcccagttgaggagtgcatc 

aacgttggtgtcccagtactcgccaacggcccagccgccccaccagttgagccagtccttgacgacccacgctccgtggcccttcacgtagtc 

aaagcgccaggcatcaacgccgatgctccttaggtaggcggcgtagctctcatcgctcgcccagagccagtgctggtcccagctcttctcgtg 

ggctatg^ctgggaagcctccaaatgtgccctcgtcacagcacttgacctcgttggggtggaagtcgaggtagttggcagtatatttgcccgag^ 

ccacctttgagaagtccgtccaggtgtagtccccaacgaacgggttccactcgaggtctccgcctgcgcggtggtttatgacgatgtccgctatg 

acctttatgccgtaggcatgggccgtgtttatcatgttcacgagctcctgcttggagccaaagcgcgtctctaccgttcccttctggtcgtactcacc 

gaggtcaaagaagtcgtaggggtcgtagcccatcgaataggcgccgcccatgcccttgctcgccgggggaatccaaatggcggatattcccg 

cctcgtaccactccggtatcttgctcctgatggtgtcccaccagattcctccacctgggacgtcccagtagaaggcctgcattataacgccgccct 

cttccagctcggagtacttggccataagttacctcctactagtagattaaaa 

SEQ HD NO: 208 

Leu Ser Thr Glu Pro Phe Val Leu Gly Ser Arg Leu Thr Leu Ser Pro Pro Arg Ser Ser Ser Arg Arg 
Ser Ser Arg Asn Ser Arg Tip Pro Gly Arg Gly Gin Gly Pro Arg Gly Thr Pro Thr Arg Leu Ser Pro 
Pro Thr Cys Pro Pro Ser Arg Arg Gly Cys Arg Cys Thr Arg Gly Cys Thr Leu Pro Arg Thr Ser Glu 
Arg Arg Pro Thr Phe Arg Leu Cys Leu Arg Arg Gly Cys Met Leu Ser Val Pro Ala Cys Phe Arg 
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Ser Arg Phe Ser Arg He Ser Ala Arg Arg Cys Arg Ser Lys Arg Arg Gin Cys Phe Leu Arg Pro Gly 
Cys His Val Ser Arg Gly Ser Ala Tyr Pro Cys Ala Thr Pro Arg Ser Arg Gly Arg He Leu Ser Ala 
Gly Pro Arg Arg Gly Thr Arg Arg Leu Asp Thr Cys Ser Arg Leu Tyr Arg Cys Arg Gly Leu Gin 
Arg Arg Leu Arg Pro Thr Gly Arg Gly Arg Leu Cys Pro Arg Ser Gly Pro Arg Arg Val Arg Glu 
Cys Ser Cys Cys Gin Arg Pro Arg Pro Ser Cys Ser Arg Ala Gly Ser Arg Arg Pro Tip Arg Arg Ser 
Ser Arg Pro Ser Gly Val His Gin Arg Trp Cys Pro Ser Thr Arg Gin Arg Pro Ser Arg Pro Thr Ser 
Ala Ser Pro Arg Pro Thr Leu Arg Gly Pro Ser Arg Ser Gin Ser Ala Arg His Gin Arg Arg Cys Ser 
Leu Gly Arg Arg Arg Ser Ser His Arg Ser Pro Arg Ala Ser Ala Gly Pro Ser Ser Ser Arg Gly Leu 
Cys Leu Gly Ser Leu Gin Met Cys Pro Arg His Ser Thr Pro Arg Tip Gly Gly Ser Arg Gly Ser Trp 
Gin Tyr He Cys Pro Arg Pro Pro Leu Arg Ser Pro Ser Arg Cys Ser Pro Gin Arg Thr Gly Ser Thr 
Arg Gly Leu Arg Leu Arg Gly Gly Leu Arg Cys Pro Leu Pro Leu Cys Arg Arg His Gly Pro Cys 
Leu Ser Cys Ser Arg Ala Pro Ala Trp Ser Gin Ser Ala Ser Leu Pro Phe Pro Ser Gly Arg Thr His 
Arg Gly Gin Arg Ser Arg Arg Gly Arg Ser Pro Ser Asn Arg Arg Arg Pro Cys Pro Cys Ser Pro Gly 
Glu Ser Lys Trp Arg He Phe Pro Pro Arg Thr Thr Pro Val Ser Cys Ser Trp Cys Pro Thr Arg Phe 
Leu His Leu Gly Arg Pro Ser Arg Arg Pro Ala Leu Arg Arg Pro Leu Pro Ala Arg Ser Thr Trp Pro 
Val Thr Ser Tyr He Lys 

SEQIDNO:209 

atgattcagcccatgcactctcgggaacaggcctgccgtctcattccggcactgatcatgacatttgcactggcactgccgttgcaaattcgtgccg 

atgtcaccctgcatgcmcaactggagctatgccgatgtcgctgatcgggccgttgacatcgctgcagcagggtacagtgccgtgctggtggc 

cgccacttcgatccgaaggcacggcctggtgggcgcgataccagccccaggatctccgccttatcgaccatccgctgggcaatacacatgacttc 

gtcaacatgatcgatgctctcgatgatgtgggtgtgggcgtgtacgccgacatcgtgctcaaccacatggccaatgaggctgcacaaaggcct^ 

cctgaactaccctggtcaggcagtgcttgacgaatatgcttccgatcccggtcamcgagggcttgaggctgttcggtaatctgagcttcaatttcc^ 

gtcggaacatgatttcggacccgcccagtgcattcaggattacagcgatgtgtttcaggtccagaactggcggctgtgcggaccgccgccggacc 

cgggcctgcccgacctggtcgccaatgactgggtgatctctcaacagcgccagtatctggaagccatcaaggcgctgggtgtggctggcatgcg 

catcgacgcggtcaagcatatgcccatgagccatatcaatgccgttctcacccccgagatccggtcgggcttgcatgtgtttggcgaagtcatcacc 

tccggtggggctggtgatacatcctacgaccgtWctggccccttacctggcacaaagcgaccatggtgcctatgacmccattgtttgaaaccatt 

cgccgtgctttcggcttcggtggcagcatgagtgaactggtcgatcctgctgcctacggtcaggccctgccaccggaccgcgccatcaccttcgtc 

atcacgcacgatattccgaacaatgacggatttcgctaccagatactcgaccccgtcgatgaatcactggcctacgcctacattctgggccgcgatg 

gcggtgtcccgcttctgtattccgacaacaatgaaagcggcgatggccgctggatcgatgcctggcaacgtccggatctggttgcaatggtcggct 

tccacaatgcagtccacggtcaggacatggccgtgctttcacatgacgactgccacctgctgtttcggcgcggcagcctcgggattgtcggcatca 

acaagtgcggccatgcactcagctcctgggtcaacatgaaccagagcgtactgtggtggtacgcggactacacagacgtgctcgacagcaacag 

cgttgtcaacatccagtcatcctggcacgagttcatcxttcccgcccgccaggcacgcctgtggttgcga 

SEQroNO:210 

MIQPMHSREQACRLIPALIMTTALALPLQIRADVTLHAFNWSYADVADRAVDL\A^ 

VLVAPPLRSEGTAWWARYQPQDLRLIDHPLGNTHDFVNNDDALDDVGVGVYADIVLNHM 

ANEAAQRPDLNYPGQAVLDEYASDPGHFEGLRLFGNLSFNFLSEHDFGPAQCIQDYSDVF 

QVQNWRLCGPPPDPGLPDLVANDWVISQQRQYLEAIKALGVAGMRIDAVKHMPMSHINA 

VLTPEIRSGLHVFGEVITSGGAGDTSYDRFLAPYLAQSDHGAYDFPLFETIRRAFGFGGSMS 

ELVDPAAYGQALPPDRAITFVITHDIPNNDGFRYQILDPVDESLAYAYILGRDGGVPLLYSD 

NhnESGDGRWIDAWQRPDLVAMVGFHNAVHGQDMAVLSHDDCHLLFRRGSLGrVGHsIKC 

GHALSSWVNMNQSVLWWYADYTDVLDSNSWNIQSSWHEFILPARQARLWLR 

SEQIDN0:211 

GTGTTTCGTTCTGACACAGTTTCGCGTACCTGCATGTATGGTGCGCTGCGTAATGCCTA 

CCAACCCGATCGGGTGTTTACTGGAGTCACGGTGCGGACATGCAACTTAAAAAAGCAT 

GCTCATCGCCAGGCGCTGTTGTTCATCGTGACGCGGTGCCTGTGCCTGAAATCCAGGC 

AGACCCATAAAAACAACAACAAACCGATAACAAACGACCCAAGCCTTCTAAGAGGAG 

AAAACGGGATGGCTTTTAAACTACGCAAAAAGGCGCTCGTTGGCCTGTTCACGGCCGG 
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CGCAATGGTATATGCCGGTGCAGCGGCGAGTGGTGAAATCATTCTGCAGGGCTTCCAC 

TGGCACTCCAAGTGGGGCGGCAACAATCAGGGTTGGTGGCAGGTGATGGAAGGTCAG 

GCCAACACCATCGCCAACGCCGGCTTTACGCACGTGTGGTTCCCGCCGGTCCATAACT 

CGGCCGATGCCGAGGGTTACCTACCCCGCGAGCTGAACAACCTCAACTCCAGCTATGG 

CTCCGAAGCACAGCTGCGCAGCGCCATCCAGGCACTGAACAATCGCGGCGTGCATGCG 

ATTGCCGATGTGGTCATGAACCACCGGGTGGGCTGCTCtGGCTCGGCGGATTTCTCTA 

ACCCGGACTGGCCGACCTGGTACATCGTCGCCAATGATTCCTGGCCCGGTGGCCCGAA 

AAGCCAGAACTGGGACACGGGTGAGACGTACCACGCGGCCCGTGACCTCGATCACGC 

CAATCCGCAGGTGCGCAACGATATCTCGCACTACCTGAACAGCCGCCTCAAGGACGTC 

GGCTTCTCCGGCTGGCGCTGGGACTATGCCAAGGGTtTCTGGCCCGGCTATGTCGGCG 

AGTACAACTGGAACACCAACCCGAACTTCTGTGTGGGTGAGGTGTGGGACGATCTCGA 

CCCCAACAATCCCAACCCGCACCGCCAGCAACTGGTGGACTGGGTTGATCCTACCXjGT 

ggcagttgtcacgtctrcgacttcaccaccaaggggctgacgaactatgcgctgcagc 

atggccagtactggcgcctgcagggtgataatggtggcccggctggcggcatcggctg 

gtggccgcaacgcatggtgaccttcgtcgacaaccatgacacgggcccgagcaatcac 

tgtggtgacggccagaacctctggcccgtgccctgtgacaaggtcatcgaggcgtatg 

cctacatcctgacccatccgggcgtgccgtcggtgtactggacgcacttcttcaactgg 

aatcttggtagcgagatcagccagttgatgcagatccgcaagaaccagggcgtccact 

ccggttccgacgtctggatcgccgaggcccgtcacggcctgtacgccgcctatatcaa 

cggtaatgtggcgatgaagatgggctgggataactggagcccgggctggggctggtc 

gctggcggcctccggtaacaactgggccgtctggacacgctga 

SEQIDNO:212 

WRSDWSRTCMYGALRNAYQPDRVFTGVTNOITCNLKKHAHRQALLFIVTRCLCLKSRQT 

HKNNNKPIThTOPSLLRGENGMAFKLRKKALVGUTTAGAMWAGAAASGEnLQGFH^ 

KWGGNNQGWWQVMEGQANTIANAGFTHVWFPPVHNSADAEGYLPRELNNLNSSYGSEA 

QLRSAIQALNNRGVHAIADVVMNHRVGCSGWADFCM>DWPTWYIVANDSWPGGPKSQN 

WDTGETYHAARDLDHANPQVRNDISHYLNSRLKDVGFSGWRWDYAKGFWPGYVGEYN 

WNTNPhffCVGEXnVDDLDPhn^NPHRQQLVDWVDATGGSCHWDFTTKGLTNYALQHGQ 

YWRLQGDNGGPAGGIGWWPQRMVTFVD>fflDTGPShraCGIXjQNLWPVPCDKVMEAYA 

YILTHPGVPSVYWTHFFNAVNLGSEISQLMQIRKNQGVHSGSDVWIAEARHGLYAAYINGN 

VAMKMGWDNWSPGWGWSLAASGNNWAVWTR 



